Growth differentiation factor-8 (GDF-8) and polynucleotides encoding same

ABSTRACT

Growth differentiation factor-8 (GDF-8) polypeptides, polynucleotides encoding GDF-8 polypeptides, and vectors and host cells containing GDF-8 encoding polynucleotides are provided.

This application is a 371 application of PCT US94/03019, filed Mar. 18,1994, and a continuation-in-part application of the U.S. applicationSer. No. 08/033,923 filed on Mar. 19, 1993. now abandoned.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The invention relates generally to growth factors and specifically to anew member of the transforming growth factor beta (TGF-β) superfamily,which is denoted, growth differentiation factor-8 (GDF-8).

2. Description of Related Art

The transforming growth factor β (TGF-β) superfamily encompasses a groupof structurally-related proteins which affect a wide range ofdifferentiation processes during embryonic development. The familyincludes, Mullerian inhibiting substance (MIS), which is required fornormal male sex development (Behringer, et al., Nature, 345:167, 1990),Drosophila decapentaplegic (DPP) gene product, which is required fordorsal-ventral axis formation and morphogenesis of the imaginal disks(Padgett, et al., Nature, 325:81-84, 1987), the Xenopus Vg-1 geneproduct, which localizes to the vegetal pole of eggs ((Weeks, et al.,Cell, 51:861-867, 1987), the activins (Mason, et al., Biochem, Biophys.Res. Commun., 135:957-964, 1986), which can induce the formation ofmesoderm and anterior structures in Xenopus embryos (Thomsen, et al.,Cell, 63:485, 1990), and the bone morphogenetic proteins (BMPs,osteogenin, OP-1) which can induce de novo cartilage and bone formation(Sampath, et al., J. Biol. Chem., 265:13198, 1990). The TGF-βs caninfluence a variety of differentiation processes, includingadipogenesis, myogenesis, chondrogenesis, hematopoiesis, and epithelialcell differentiation (for review, see Massague, Cell 49:437, 1987).

The proteins of the TGF-β family are initially synthesized as a largeprecursor protein which subsequently undergoes proteolytic cleavage at acluster of basic residues approximately 110-140 amino acids from theC-terminus. The C-terminal regions, or mature regions, of the proteinsare all structurally related and the different family members can beclassified into distinct subgroups based on the extent of theirhomology. Although the homologies within particular subgroups range from70% to 90% amino acid sequence identity, the homologies betweensubgroups are significantly lower, generally ranging from only 20% to50%. In each case, the active species appears to be a disulfide-linkeddimer of C-terminal fragments. Studies have shown that when thepro-region of a member of the TGF-β family is coexpressed with a matureregion of another member of the TGF-β family, intracellular dimerizationand secretion of biologically active homodimers occur (Gray, A., andMaston, A., Science, 247:1328, 1990). Additional studies by Hammonds, etal., (Molec. Endocrin. 5:149, 1991) showed that the use of the BMP-2pro-region combined with the BMP-4 mature region led to dramaticallyimproved expression of mature BMP-4. For most of the family members thathave been studied, the homodimeric species has been found to bebiologically active, but for other family members, like the inhibins(Ling, et al., Nature, 321:779, 1986) and the TGF-βs (Cheifetz, et al.,Cell, 48:409, 1987), heterodimers have also been detected, and theseappear to have different biological properties than the respectivehomodimers.

Identification of new factors that are tissue-specific in theirexpression pattern will provide a greater understanding of that tissue'sdevelopment and function.

SUMMARY OF THE INVENTION

-The present invention provides a cell growth and differentiationfactor, GDF-8, a polynucleotide sequence which encodes the factor, andantibodies which are immunoreactive with the factor. This factor appearsto relate to various cell proliferative disorders, especially thoseinvolving those involving muscle, nerve, and adipose tissue.

Thus, in one embodiment, the invention provides a method for detecting acell proliferative disorder of muscle, nerve, or fat origin and which isassociated with GDF-8. In another embodiment, the invention provides amethod for treating a cell proliferative disorder by suppressing orenhancing GDF-8 activity.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a Northern blot showing expression of GDF-8 mRNA in adulttissues. The probe was a partial murine GDF-8 clone.

FIGS. 2(a-b) shows nucleotide and predicted amino acid sequences ofmurine GDF-8 (FIG. 2a; SEQ ID NOS:5 and 6, respectively) and human GDF-8(FIG. 2b; SEQ ID NOS:7 and 8, respectively). The putative dibasicprocessing sites in the murine sequence are boxed.

FIG. 3 shows the alignment of the C-terminal sequences of GDF-8 withother members of the TGF-β superfamily (SEQ ID NOS:14 and 18-31). Theconserved cysteine residues are boxed. Dashes denote gaps introduced inorder to maximize alignment.

FIG. 4 shows amino acid homologies among different members of the TGF-βsuperfamily. Numbers represent percent amino acid identities betweeneach pair calculated from the first conserved cysteine to theC-terminus. Boxes represent homologies among highly-related memberswithin particular subgroups.

FIGS. 5(a-d) shows the sequence of GDF-8. Nucleotide and amino acidsequences of murine (FIG. 5a and FIG. 5b) and human (FIGS. 5c and 5d)GDF-8 cDNA clones are shown. Numbers indicate nucleotide positionrelative to the 5' end. Consensus N-linked glycosylation signals areshaded. The putative RXXR (SEQ ID NO:32) proteolytic cleavage sites areboxed.

FIGS. 6(a-b) shows a hydropathicity profile of GDF-8. Averagehydrophobicity values for murine (FIG. 6a) and human (FIG. 6b) GDF-8were calculated using the method of Kyte and Doolittle (J. Mol. Biol.,157:105-132, 1982). Positive numbers indicate increasing hydrophobicity.

FIG. 7 shows a comparison of murine and human GDF-8 amino acid sequences(SEQ ID NOS:12 and 14, respectively). The predicted murine sequence isshown in the top lines and the predicted human sequence is shown in thebottom lines. Numbers indicate amino acid position relative to theN-terminus. Identities between the two sequences are denoted by avertical line.

FIG. 8 shows the expression of GDF-8 in bacteria. BL21 (DE3) (pLysS)cells carrying a pRSET/GDF-8 expression plasmid were induced withisopropylthio-β-galactoside, and the GDF-8 fusion protein was purifiedby metal chelate chromatography. Lanes: total=total cell lysate;soluble=soluble protein fraction; insoluble=insoluble protein fraction(resuspended in 10 mM Tris pH 8.0, 50 mM sodium phosphate, 8M urea, and10 mM β-mercaptoethanol buffer B!) loaded onto the column;pellet=insoluble protein fraction discarded before loading the column;flowthrough=proteins not bound by the column; washes=washes carried outin buffer B at the indicated pH's. Positions of molecular weightstandards are shown at the right. Arrow indicates the position of theGDF-8 fusion protein.

FIG. 9 shows the expression of GDF-8 in mammalian cells. Chinese hamsterovary cells were transfected with pMSXND/GDF-8 expression plasmids andselected in G418. Conditioned media from G418-resistant cells (preparedfrom cells transfected with constructs in which GDF-8 was cloned ineither the antisense or sense orientation) were concentrated,electrophoresed under reducing conditions, blotted, and probed withanti-GDF-8 antibodies and ¹²⁵ I!iodoproteinA. Arrow indicates theposition of the processed GDF-8 protein.

FIGS. 10(a-b) shows the expression of GDF-8 mRNA. Poly A-selected RNA (5μg each) prepared from adult tissues (FIG. 1Oa) or placentas and embryos(FIG. 10b) at the indicated days of gestation was electrophoresed onformaldehyde gels, blotted, and probed with full length murine GDF-8.

FIG. 11 shows chromosomal mapping of human GDF-8. DNA samples preparedfrom human/rodent somatic cell hybrid lines were subjected to PCR,electrophoresed on agarose gels, blotted, and probed. The humanchromosome contained in each of the hybrid cell lines is identified atthe top of each of the first 24 lanes (1-22, X, and Y). In the lanesdesignated M, CHO, and H, the starting DNA template was total genomicDNA from mouse, hamster, and human sources, respectively. In the lanemarked B1, no template DNA was used. Numbers at left indicate themobilities of DNA standards.

DETAILED DESCRIPTION OF THE INVENTION

The present invention provides a growth and differentiation factor,GDF-8 and a polynucleotide sequence encoding GDF-8. GDF-8 is expressedat highest levels in muscle and at lower levels in adipose tissue. Inone embodiment, the invention provides a method for detection of a cellproliferative disorder of muscle, nerve, or fat origin which isassociated with GDF-8 expression. In another embodiment, the inventionprovides a method for treating a cell proliferative disorder by using anagent which suppresses or enhances GDF-8 activity.

The TGF-β superfamily consists of multifunctional polypeptides thatcontrol proliferation, differentiation, and other functions in many celltypes. Many of the peptides have regulatory, both positive and negative,effects on other peptide growth factors. The structural homology betweenthe GDF-8 protein of this invention and the members of the TGF-β family,indicates that GDF-8 is a new member of the family of growth anddifferentiation factors. Based on the known activities of many of theother members, it can be expected that GDF-8 will also possessbiological activities that will make it useful as a diagnostic andtherapeutic reagent.

In particular, certain members of this superfamily have expressionpatterns or possess activities that relate to the function of thenervous system. For example, the inhibins and activins have been shownto be expressed in the brain (Meunier, et al., Proc. Natl. Acad. Sci.,U.S.A., 85:247, 1988; Sawchenko, et al., Nature, 334:615, 1988), andactivin has been shown to be capable of functioning as a nerve cellsurvival molecule (Schubert, et al., Nature, 344:868, 1990). Anotherfamily member, namely, GDF-1, is nervous system-specific in itsexpression pattern (Lee, S. J., Proc. Natl. Acad. Sci., U.S.A., 88:4250,1991), and certain other family members, such as Vgr-1 (Lyons, et al.,Proc. Natl Acad. Sci., U.S.A., 86:4554, 1989; Jones, et al.,Development, 111:531, 1991), OP-1 (Ozkaynak, et al., J. Biol. Chem.,267:25220, 1992), and BMP-4 (Jones, et al., Development, 111:531, 1991),are also known to be expressed in the nervous system. Because it isknown that skeletal muscle produces a factor or factors that promote thesurvival of motor neurons (Brown, Trends Neurosci., 7:10, 1984), theexpression of GDF-8 in muscle suggests that one activity of GDF-8 may beas a trophic factor for neurons. In this regard, GDF-8 may haveapplications in the treatment of neurodegenerative diseases, such asamyotrophic lateral sclerosis, or in maintaining cells or tissues inculture prior to transplantation.

GDF-8 may also have applications in treating disease processes involvingmuscle, such as in musculodegenerative diseases or in tissue repair dueto trauma. In this regard, many other members of the TGF-β family arealso important mediators of tissue repair. TGF-β has been shown to havemarked effects on the formation of collagen and to cause a strikingangiogenic response in the newborn mouse (Roberts, et al., Proc. Natl.Acad. Sci., U.S.A. 83:4167, 1986). TGF-β has also been shown to inhibitthe differentiation of myoblasts in culture (Massague, et al., Proc.Natl. Acad. Sci., U.S.A. 83:8206, 1986). Moreover, because myoblastcells may be used as a vehicle for delivering genes to muscle for genetherapy, the properties of GDF-8 could be exploited for maintainingcells prior to transplantation or for enhancing the efficiency of thefusion process.

The expression of GDF-8 in adipose tissue also raises the possibility ofapplications for GDF-8 in the treatment of obesity or of disordersrelated to abnormal proliferation of adipocytes. In this regard, TGF-βhas been shown to be a potent inhibitor of adipocyte differentiation invitro (Ignotz and Massague, Proc. Natl. Acad. Sci., U.S.A. 82:8530,1985).

The term "substantially pure" as used herein refers to GDF-8 which issubstantially free of other proteins, lipids, carbohydrates or othermaterials with which it is naturally associated. One skilled in the artcan purify GDF-8 using standard techniques for protein purification. Thesubstantially pure polypeptide will yield a single major band on anon-reducing polyacrylamide gel. The purity of the GDF-8 polypeptide canalso be determined by amino-terminal amino acid sequence analysis. GDF-8polypeptide includes functional fragments of the polypeptide, as long asthe activity of GDF-8 remains. Smaller peptides containing thebiological activity of GDF-8 are included in the invention.

The invention provides polynucleotides encoding the GDF-8 protein. Thesepolynucleotides include DNA, cDNA and RNA sequences which encode GDF-8.It is understood that all polynucleotides encoding all or a portion ofGDF-8 are also included herein, as long as they encode a polypeptidewith GDF-8 activity. Such polynucleotides include naturally occurring,synthetic, and intentionally manipulated polynucleotides. For example,GDF-8 polynucleotide may be subjected to site-directed mutagenesis. Thepolynucleotide sequence for GDF-8 also includes antisense sequences. Thepolynucleotides of the invention include sequences that are degenerateas a result of the genetic code. There are 20 natural amino acids, mostof which are specified by more than one codon. Therefore, all degeneratenucleotide sequences are included in the invention as long as the aminoacid sequence of GDF-8 polypeptide encoded by the nucleotide sequence isfunctionally unchanged.

Specifically disclosed herein is a genomic DNA sequence containing aportion of the GDF-8 gene. The sequence contains an open reading framecorresponding to the predicted C-terminal region of the GDF-8 precursorprotein. The encoded polypeptide is predicted to contain two potentialproteolytic processing sites (KR and RR). Cleavage of the precursor atthe downstream site would generate a mature biologically activeC-terminal fragment of 109 amino acids with a predicted molecular weightof approximately 12,400. Also, disclosed are full length murine andhuman GDF-8 cDNA sequences. The murine pre-pro-GDF-8 protein is 376amino acids in length, which is encoded by a 2676 base pair nucleotidesequence, beginning at nucleotide 104 and extending to a TGA stop codonat nucleotide 1232. The human GDF-8 protein is 375 amino acids and isencoded by a 2743 base pair sequence, with the open reading framebeginning at nucleotide 59 and extending to nucleotide 1184.

The C-terminal region of GDF-8 following the putative proteolyticprocessing site shows significant homology to the known members of theTGF-β superfamily. The GDF-8 sequence contains most of the residues thatare highly conserved in other family members (see FIG. 3). Like theTGF-βs and inhibin βs, GDF-8 contains an extra pair of cysteine residuesin addition to the 7 cysteines found in virtually all other familymembers. Among the known family members, GDF-8 is most homologous toVgr-1 (45% sequence identity) (see FIG. 4).

Minor modifications of the recombinant GDF-8 primary amino acid sequencemay result in proteins which have substantially equivalent activity ascompared to the GDF-8 polypeptide described herein. Such modificationsmay be deliberate, as by site-directed mutagenesis, or may bespontaneous. All of the polypeptides produced by these modifications areincluded herein as long as the biological activity of GDF-8 stillexists. Further, deletion of one or more amino acids can also result ina modification of the structure of the resultant molecule withoutsignificantly altering its biological activity. This can lead to thedevelopment of a smaller active molecule which would have broaderutility. For example, one can remove amino or carboxy terminal aminoacids which are not required for GDF-8 biological activity.

The nucleotide sequence encoding the GDF-8 polypeptide of the inventionincludes the disclosed sequence and conservative variations thereof. Theterm "conservative variation" as used herein denotes the replacement ofan amino acid residue by another, biologically similar residue. Examplesof conservative variations include the substitution of one hydrophobicresidue such as isoleucine, valine, leucine or methionine for another,or the substitution of one polar residue for another, such as thesubstitution of arginine for lysine, glutamic for aspartic acid, orglutamine for asparagine, and the like. The term "conservativevariation" also includes the use of a substituted amino acid in place ofan unsubstituted parent amino acid provided that antibodies raised tothe substituted polypeptide also immunoreact with the unsubstitutedpolypeptide.

DNA sequences of the invention can be obtained by several methods. Forexample, the DNA can be isolated using hybridization techniques whichare well known in the art. These include, but are not limited to: 1)hybridization of genomic or cDNA libraries with probes to detecthomologous nucleotide sequences, 2) polymerase chain reaction (PCR) ongenomic DNA or cDNA using primers capable of annealing to the DNAsequence of interest, and 3) antibody screening of expression librariesto detect cloned DNA fragments with shared structural features.

Preferably the GDF-8 polynucleotide of the invention is derived from amammalian organism, and most preferably from a mouse, rat, or human.Screening procedures which rely on nucleic acid hybridization make itpossible to isolate any gene sequence from any organism, provided theappropriate probe is available. Oligonucleotide probes, which correspondto a part of the sequence encoding the protein in question, can besynthesized chemically. This requires that short, oligopeptide stretchesof amino acid sequence must be known. The DNA sequence encoding theprotein can be deduced from the genetic code, however, the degeneracy ofthe code must be taken into account. It is possible to perform a mixedaddition reaction when the sequence is degenerate. This includes aheterogeneous mixture of denatured double-stranded DNA. For suchscreening, hybridization is preferably performed on eithersingle-stranded DNA or denatured double-stranded DNA. Hybridization isparticularly useful in the detection of cDNA clones derived from sourceswhere an extremely low amount of mRNA sequences relating to thepolypeptide of interest are present. In other words, by using stringenthybridization conditions directed to avoid non-specific binding, it ispossible, for example, to allow the autoradiographic visualization of aspecific cDNA clone by the hybridization of the target DNA to thatsingle probe in the mixture which is its complete complement (Wallace,et al., Nucl. Acid Res., 9:879, 1981).

The development of specific DNA sequences encoding GDF-8 can also beobtained by: 1) isolation of double-stranded DNA sequences from thegenomic DNA; 2) chemical manufacture of a DNA sequence to provide thenecessary codons for the polypeptide of interest; and 3) in vitrosynthesis of a double-stranded DNA sequence by reverse transcription ofmRNA isolated from a eukaryotic donor cell. In the latter case, adouble-stranded DNA complement of mRNA is eventually formed which isgenerally referred to as cDNA.

Of the three above-noted methods for developing specific DNA sequencesfor use in recombinant procedures, the isolation of genomic DNA isolatesis the least common. This is especially true when it is desirable toobtain the microbial expression of mammalian polypeptides due to thepresence of introns.

The synthesis of DNA sequences is frequently the method of choice whenthe entire sequence of amino acid residues of the desired polypeptideproduct is known. When the entire sequence of amino acid residues of thedesired polypeptide is not known, the direct synthesis of DNA sequencesis not possible and the method of choice is the synthesis of cDNAsequences. Among the standard procedures for isolating cDNA sequences ofinterest is the formation of plasmid- or phage-carrying cDNA librarieswhich are derived from reverse transcription of mRNA which is abundantin donor cells that have a high level of genetic expression. When usedin combination with polymerase chain reaction technology, even rareexpression products can be cloned. In those cases where significantportions of the amino acid sequence of the polypeptide are known, theproduction of labeled single or double-stranded DNA or RNA probesequences duplicating a sequence putatively present in the target cDNAmay be employed in DNA/DNA hybridization procedures which are carriedout on cloned copies of the cDNA which have been denatured into asingle-stranded form (Jay, et al., Nucl. Acid Res., 11:2325, 1983).

A cDNA expression library, such as lambda gt11, can be screenedindirectly for GDF-8 peptides having at least one epitope, usingantibodies specific for GDF-8. Such antibodies can be eitherpolyclonally or monoclonally derived and used to detect expressionproduct indicative of the presence of GDF-8 cDNA.

DNA sequences encoding GDF-8 can be expressed in vitro by DNA transferinto a suitable host cell. "Host cells" are cells in which a vector canbe propagated and its DNA expressed. The term also includes any progenyof the subject host cell. It is understood that all progeny may not beidentical to the parental cell since there may be mutations that occurduring replication. However, such progeny are included when the term"host cell" is used.

Methods of stable transfer, meaning that the foreign DNA is continuouslymaintained in the host, are known in the art.

In the present invention, the GDF-8 polynucleotide sequences may beinserted into a recombinant expression vector. The term "recombinantexpression vector" refers to a plasmid, virus or other vehicle known inthe art that has been manipulated by insertion or incorporation of theGDF-8 genetic sequences. Such expression vectors contain a promotersequence which facilitates the efficient transcription of the insertedgenetic sequence of the host. The expression vector typically containsan origin of replication, a promoter, as well as specific genes whichallow phenotypic selection of the transformed cells. Vectors suitablefor use in the present invention include, but are not limited to theT7-based expression vector for expression in bacteria (Rosenberg, etal., Gene, 56:125, 1987), the pMSXND expression vector for expression inmammalian cells (Lee and Nathans, J. Biol. Chem., 263:3521, 1988) andbaculovirus-derived vectors for expression in insect cells. The DNAsegment can be present in the vector operably linked to regulatoryelements, for example, a promoter (e.g., T7, metallothionein I, orpolyhedrin promoters).

Polynucleotide sequences encoding GDF-8 can be expressed in eitherprokaryotes or eukaryotes. Hosts can include microbial, yeast, insectand mammalian organisms. Methods of expressing DNA sequences havingeukaryotic or viral sequences in prokaryotes are well known in the art.Biologically functional viral and plasmid DNA vectors capable ofexpression and replication in a host are known in the art. Such vectorsare used to incorporate DNA sequences of the invention. Preferably, themature C-terminal region of GDF-8 is expressed from a cDNA clonecontaining the entire coding sequence of GDF-8. Alternatively, theC-terminal portion of GDF-8 can be expressed as a fusion protein withthe pro-region of another member of the TGF-β family or co-expressedwith another pro-region (see for example, Hammonds, et al., Molec.Endocrin. 5:149, 1991; Gray, A., and Mason, A., Science, 247:1328,1990).

Transformation of a host cell with recombinant DNA may be carried out byconventional techniques as are well known to those skilled in the art.Where the host is prokaryotic, such as E. coli, competent cells whichare capable of DNA uptake can be prepared from cells harvested afterexponential growth phase and subsequently treated by the CaCl₂ methodusing procedures well known in the art. Alternatively, MgCl₂ or RbCl canbe used. Transformation can also be performed after forming a protoplastof the host cell if desired.

When the host is a eukaryote, such methods of transfection of DNA ascalcium phosphate co-precipitates, conventional mechanical proceduressuch as microinjection, electroporation, insertion of a plasmid encasedin liposomes, or virus vectors may be used. Eukaryotic cells can also becotransformed with DNA sequences encoding the GDF-8 of the invention,and a second foreign DNA molecule encoding a selectable phenotype, suchas the herpes simplex thymidine kinase gene. Another method is to use aeukaryotic viral vector, such as simian virus 40 (SV40) or bovinepapilloma virus, to transiently infect or transform eukaryotic cells andexpress the protein. (see for example, Eukaryotic Viral Vectors, ColdSpring Harbor Laboratory, Gluzman ed., 1982).

Isolation and purification of microbial expressed polypeptide, orfragments thereof, provided by the invention, may be carried out byconventional means including preparative chromatography andimmunological separations involving monoclonal or polyclonal antibodies.

The invention includes antibodies immunoreactive with GDF-8 polypeptideor functional fragments thereof. Antibody which consists essentially ofpooled monoclonal antibodies with differer epitopic specificities, aswell as distinct monoclonal antibody preparations are provided.Monoclonal antibodies are made from antigen containing fragments of the-protein by methods well known to those skilled in the art (Kohler, etal., Nature, 256:495, 1975). The term antibody as used in this inventionis meant to include intact molecules as well as fragments thereof, suchas Fab and F(ab')₂, which are capable of binding an epitopic determinanton GDF-8.

The term "cell-proliferative disorder" denotes malignant as well asnon-malignant cell populations which often appear to differ from thesurrounding tissue both morphologically and genotypically. Malignantcells (i.e. cancer) develop as a result of a multistep process. TheGDF-8 polynucleotide that is an antisense molecule is useful in treatingmalignancies of the various organ systems, particularly, for example,cells in muscle or adipose tissue. Essentially, any disorder which isetiologically linked to altered expression of GDF-8 could be consideredsusceptible to treatment with a GDF-8 suppressing reagent. One suchdisorder is a malignant cell proliferative disorder, for example.

The invention provides a method for detecting a cell proliferativedisorder of muscle or adipose tissue which comprises contacting ananti-GDF-8 antibody with a cell suspected of having a GDF-8 associateddisorder and detecting binding to the antibody. The antibody reactivewith GDF-8 is labeled with a compound which allows detection of bindingto GDF-8. For purposes of the invention, an antibody specific for GDF-8polypeptide may be used to detect the level of GDF-8 in biologicalfluids and tissues. Any specimen containing a detectable amount ofantigen can be used. A preferred sample of this invention is muscletissue. The level of GDF-8 in the suspect cell can be compared with thelevel in a normal cell to determine whether the subject has aGDF-8-associated cell proliferative disorder. Preferably the subject ishuman.

The antibodies of the invention can be used in any subject in which itis desirable to administer in vitro or in vivo immunodiagnosis orimmunotherapy. The antibodies of the invention are suited for use, forexample, in immunoassays in which they can be utilized in liquid phaseor bound to a solid phase carrier. In addition, the antibodies in theseimmunoassays can be detectably labeled in various ways. Examples oftypes of immunoassays which can utilize antibodies of the invention arecompetitive and non-competitive immunoassays in either a direct orindirect format. Examples of such immunoassays are the radioimmunoassay(RIA) and the sandwich (immunometric) assay. Detection of the antigensusing the antibodies of the invention can be done utilizing immunoassayswhich are run in either the forward, reverse, or simultaneous modes,including immunohistochemical assays on physiological samples. Those ofskill in the art will know, or can readily discern, other immunoassayformats without undue experimentation.

The antibodies of the invention can be bound to many different carriersand used to detect the presence of an antigen comprising the polypeptideof the invention. Examples of well-known carriers include glass,polystyrene, polypropylene, polyethylene, dextran, nylon, amylases,natural and modified celluloses, polyacrylamides, agaroses andmagnetite. The nature of the carrier can be either soluble or insolublefor purposes of the invention. Those skilled in the art will know ofother suitable carriers for binding antibodies, or will be able toascertain such, using routine experimentation.

There are many different labels and methods of labeling known to thoseof ordinary skill in the art. Examples of the types of labels which canbe used in the present invention include enzymes, radioisotopes,fluorescent compounds, colloidal metals, chemiluminescent compounds,phosphorescent compounds, and bioluminescent compounds. Those ofordinary skill in the art will know of other suitable labels for bindingto the antibody, or will be able to ascertain such, using routineexperimentation.

Another technique which may also result in greater sensitivity consistsof coupling the antibodies to low molecular weight haptens. Thesehaptens can then be specifically detected by means of a second reaction.For example, it is common to use such haptens as biotin, which reactswith avidin, or dinitrophenyl, puridoxal, and fluorescein, which canreact with specific antihapten antibodies.

In using the monoclonal antibodies of the invention for the in vivodetection of antigen, the detectably labeled antibody is given a dosewhich is diagnostically effective. The term "diagnostically effective"means that the amount of detectably labeled monoclonal antibody isadministered in sufficient quantity to enable detection of the sitehaving the antigen comprising a polypeptide of the invention for whichthe monoclonal antibodies are specific.

The concentration of detectably labeled monoclonal antibody which isadministered should be sufficient such that the binding to those cellshaving the polypeptide is detectable compared to the background.Further, it is desirable that the detectably labeled monoclonal antibodybe rapidly cleared from the circulatory system in order to give the besttarget-to-background signal ratio.

As a rule, the dosage of detectably labeled monoclonal antibody for invivo diagnosis will vary depending on such factors as age, sex, andextent of disease of the individual. Such dosages may vary, for example,depending on whether multiple injections are given, antigenic burden,and other factors known to those of skill in the art.

For in vivo diagnostic imaging, the type of detection instrumentavailable is a major factor in selecting a given radioisotope. Theradioisotope chosen must have a type of decay which is detectable for agiven type of instrument. Still another important factor in selecting aradioisotope for in vivo diagnosis is that deleterious radiation withrespect to the host is minimized. Ideally, a radioisotope used for invivo imaging will lack a particle emission, but produce a large numberof photons in the 140-250 keV range, which may readily be detected byconventional gamma cameras.

For in vivo diagnosis radioisotopes may be bound to immunoglobulineither directly or indirectly by using an intermediate functional group.Intermediate functional groups which often are used to bindradioisotopes which exist as metallic ions to immunoglobulins are thebifunctional chelating agents such as diethylenetriaminepentacetic acid(DTPA) and ethylenediaminetetraacetic acid (EDTA) and similar molecules.Typical examples of metallic ions which can be bound to the monoclonalantibodies of the invention are ¹¹¹ In, ⁹⁷ Ru, ⁶⁷ Ga, ⁶⁸ Ga, ⁷² As, ⁸⁹Zr, and ²⁰¹ TI.

The monoclonal antibodies of the invention can also be labeled with aparamagnetic isotope for purposes of in vivo diagnosis, as in magneticresonance imaging (MRI) or electron spin resonance (ESR). In general,any conventional method for visualizing diagnostic imaging can beutilized. Usually gamma and positron emitting radioisotopes are used forcamera imaging and paramagnetic isotopes for MRI. Elements which areparticularly useful in such techniques include ¹⁵⁷ Gd, ⁵⁵ Mn, ¹⁶² Dy, ⁵²Cr, and ⁵⁶ Fe.

The monoclonal antibodies of the invention can be used in vitro and invivo to monitor the course of amelioration of a GDF-8-associated diseasein a subject. Thus, for example, by measuring the increase or decreasein the number of cells expressing antigen comprising a polypeptide ofthe invention or changes in the concentration of such antigen present invarious body fluids, it would be possible to determine whether aparticular therapeutic regimen aimed at ameliorating theGDF-8-associated disease is effective. The term "ameliorate" denotes alessening of the detrimental effect of the GDF-8-associated disease inthe subject receiving therapy.

The present invention identifies a nucleotide sequence that can beexpressed in an altered manner as compared to expression in a normalcell, therefore it is possible to design appropriate therapeutic ordiagnostic techniques directed to this sequence. Thus, where acell-proliferative disorder is associated with the expression of GDF-8,nucleic acid sequences that interfere with GDF-8 expression at thetranslational level can be used. This approach utilizes, for example,antisense nucleic acid and ribozymes to block translation of a specificGDF-8 mRNA, either by masking that mRNA with an antisense nucleic acidor by cleaving it with a ribozyme. Such disorders includeneurodegenerative diseases, for example.

Antisense nucleic acids are DNA or RNA molecules that are complementaryto at least a portion of a specific mRNA molecule (Weintraub, ScientificAmerican, 262:40, 1990). In the cell, the antisense nucleic acidshybridize to the corresponding mRNA, forming a double-stranded molecule.The antisense nucleic acids interfere with the translation of the mRNA,since the cell will not translate a mRNA that is double-stranded.Antisense oligomers of about 15 nucleotides are preferred, since theyare easily synthesized and are less likely to cause problems than largermolecules when introduced into the target GDF-8-producing cell. The useof antisense methods to inhibit the in vitro translation of genes iswell known in the art (Marcus-Sakura, Anal. Biochem., 172:289, 1988).

Ribozymes are RNA molecules possessing the ability to specificallycleave other single-stranded RNA in a manner analogous to DNArestriction endonucleases. Through the modification of nucleotidesequences which encode these RNAs, it is possible to engineer moleculesthat recognize specific nucleotide sequences in an RNA molecule andcleave it (Cech, J. Amer. Med. Assn., 260:3030, 1988). A major advantageof this approach is that, because they are sequence-specific, only mRNAswith particular sequences are inactivated.

There are two basic types of ribozymes namely, tetrahymena-type(Hasselhoff, Nature, 334:585, 1988) and "hammerhead"-type.Tetrahymena-type ribozymes recognize sequences which are four bases inlength, while "hammerhead"-type ribozymes recognize base sequences 11-18bases in length. The longer the recognition sequence, the greater thelikelihood that the sequence will occur exclusively in the target mRNAspecies. Consequently, hammerhead-type ribozymes are preferable totetrahymena-type ribozymes for inactivating a specific mRNA species and18-based recognition sequences are preferable to shorter recognitionsequences.

The present invention also provides gene therapy for the treatment ofcell proliferative or immunologic disorders which are mediated by GDF-8protein. Such therapy would achieve its therapeutic effect byintroduction of the GDF-8 antisense polynucleotide into cells having theproliferative disorder. Delivery of antisense GDF-8 polynucleotide canbe achieved using a recombinant expression vector such as a chimericvirus or a colloidal dispersion system. Especially preferred fortherapeutic delivery of antisense sequences is the use of targetedliposomes.

Various viral vectors which can be utilized for gene therapy as taughtherein include adenovirus, herpes virus, vaccinia, or, preferably, anRNA virus such as a retrovirus. Preferably, the retroviral vector is aderivative of a murine or avian retrovirus. Examples of retroviralvectors in which a single foreign gene can be inserted include, but arenot limited to: Moloney murine leukemia virus (MoMuLV), Harvey murinesarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), and RousSarcoma Virus (RSV). A number of additional retroviral vectors canincorporate multiple genes. All of these vectors can transfer orincorporate a gene for a selectable marker so that transduced cells canbe identified and generated. By inserting a GDF-8 sequence of interestinto the viral vector, along with another gene which encodes the ligandfor a receptor on a specific target cell, for example, the vector is nowtarget specific. Retroviral vectors can be made target specific byattaching, for example, a sugar, a glycolipid, or a protein. Preferredtargeting is accomplished by using an antibody to target the retroviralvector. Those of skill in the art will know of, or can readily ascertainwithout undue experimentation, specific polynucleotide sequences whichcan be inserted into the retroviral genome or attached to a viralenvelope to allow target specific delivery of the retroviral vectorcontaining the GDF-8 antisense polynucleotide.

Since recombinant retroviruses are defective, they require assistance inorder to produce infectious vector particles. This assistance can beprovided, for example, by using helper cell lines that contain plasmidsencoding all of the structural genes of the retrovirus under the controlof regulatory sequences within the LTR. These plasmids are missing anucleotide sequence which enables the packaging mechanism to recognizean RNA transcript for encapsidation. Helper cell lines which havedeletions of the packaging signal include, but are not limited to Ψ2,PA317 and PA12, for example. These cell lines produce empty virions,since no genome is packaged. If a retroviral vector is introduced intosuch cells in which the packaging signal is intact, but the structuralgenes are replaced by other genes of interest, the vector can bepackaged and vector virion produced.

Alternatively, NIH 3T3 or other tissue culture cells can be directlytransfected with plasmids encoding the retroviral structural genes gag,pol and env, by conventional calcium phosphate transfection. These cellsare then transfected with the vector plasmid containing the genes ofinterest. The resulting cells release the retroviral vector into theculture medium.

Another targeted delivery system for GDF-8 antisense polynucleotides isa colloidal dispersion system. Colloidal dispersion systems includemacromolecule complexes, nanocapsules, microspheres, beads, andlipid-based systems including oil-in-water emulsions, micelles, mixedmicelles, and liposomes. The preferred colloidal system of thisinvention is a liposome. Liposomes are artificial membrane vesicleswhich are useful as delivery vehicles in vitro and in vivo. It has beenshown that large unilamellar vesicles (LUV), which range in size from0.2-4.0 μm can encapsulate a substantial percentage of an aqueous buffercontaining large macromolecules. RNA, DNA and intact virions can beencapsulated within the aqueous interior and be delivered to cells in abiologically active form (Fraley, et al., Trends Biochem. Sci., 6:77,1981). In addition to mammalian cells, liposomes have been used fordelivery of polynucleotides in plant, yeast and bacterial cells. Inorder for a liposome to be an efficient gene transfer vehicle, thefollowing characteristics should be present: (1) encapsulation of thegenes of interest at high efficiency while not compromising theirbiological activity; (2) preferential and substantial binding to atarget cell in comparison to non-target cells; (3) delivery of theaqueous contents of the vesicle to the target cell cytoplasm at highefficiency; and (4) accurate and effective expression of geneticinformation (Mannino, et al., Biotechniques, 6:682, 1988).

The composition of the liposome is usually a combination ofphospholipids, particularly high-phase-transition-temperaturephospholipids, usually in combination with steroids, especiallycholesterol. Other phospholipids or other lipids may also be used. Thephysical characteristics of liposomes depend on pH, ionic strength, andthe presence of divalent cations.

Examples of lipids useful in liposome production include phosphatidylcompounds, such as phosphatidylglycerol, phosphatidylcholine,phosphatidylserine, phosphatidylethanolamine, sphingolipids,cerebrosides, and gangliosides. Particularly useful arediacylphosphatidylglycerols, where the lipid moiety contains from 14-18carbon atoms, particularly from 16-18 carbon atoms, and is saturated.Illustrative phospholipids include egg phosphatidylcholine,dipalmitoylphosphatidylcholine and distearoylphosphatidylcholine.

The targeting of liposomes can be classified based on anatomical andmechanistic factors. Anatomical classification is based on the level ofselectivity, for example, organ-specific, cell-specific, andorganelle-specific. Mechanistic targeting can be distinguished basedupon whether it is passive or active. Passive targeting utilizes thenatural tendency of liposomes to distribute to cells of thereticulo-endothelial system (RES) in organs which contain sinusoidalcapillaries. Active targeting, on the other hand, involves alteration ofthe liposome by coupling the liposome to a specific ligand such as amonoclonal antibody, sugar, glycolipid, or protein, or by changing thecomposition or size of the liposome in order to achieve targeting toorgans and cell types other than the naturally occurring sites oflocalization.

The surface of the targeted delivery system may be modified in a varietyof ways. In the case of a liposomal targeted delivery system, lipidgroups can be incorporated into the lipid bilayer of the liposome inorder to maintain the targeting ligand in stable association with theliposomal bilayer. Various linking groups can be used for joining thelipid chains to the targeting ligand.

Due to the expression of GDF-8 in muscle and adipose tissue, there are avariety of applications using the polypeptide, polynucleotide, andantibodies of the invention, related to these tissues. Such applicationsinclude treatment of cell proliferative disorders involving these andother tissues, such as neural tissue. In addition, GDF-8 may be usefulin various gene therapy procedures.

The data in Example 6 shows that the human GDF-8 gene is located onchromosome 2. By comparing the chromosomal location of GDF-8 with themap positions of various human disorders, it should be possible todetermine whether mutations in the GDF-8 gene are involved in theetiology of human diseases. For example, an autosomal recessive form ofjuvenile amyotrophic lateral sclerosis has been shown to map tochromosome 2 (Hentati, et al., Neurology, 42 Suppl.3!:201, 1992). Moreprecise mapping of GDF-8 and analysis of DNA from these patients mayindicate that GDF-8 is, in fact, the gene affected in this disease. Inaddition, GDF-8 is useful for distinguishing chromosome 2 from otherchromosomes.

The following examples are intended to illustrate but not limit theinvention. While they are typical of those that might be used, otherprocedures known to those skilled in the art may alternatively be used.

EXAMPLE 1 IDENTIFICATION AND ISOLATION OF A NOVEL TGF-β FAMILY MEMBER

To identify a new member of the TGF-β superfamily, degenerateoligonucleotides were designed which corresponded to two conservedregions among the known family members: one region spanning the twotryptophan residues conserved in all family members except MIS and theother region spanning the invariant cysteine residues near theC-terminus. These primers were used for polymerase chain reactions onmouse genomic DNA followed by subcloning the PCR products usingrestriction sites placed at the 5' ends of the primers, pickingindividual E. coli colonies carrying these subcloned inserts, and usinga combination of random sequencing and hybridization analysis toeliminate known members of the superfamily.

GDF-8 was identified from a mixture of PCR products obtained with theprimers ##STR1## PCR products of approximately 2 μg mouse genomic DNA at94° C. for 1 min, 50° C. for 2 min, and 72° C. for 2 min for 40 cycles.

PCR products of approximately 280 bp were gel-purified, digested withEco RI, gel-purified again, and subcloned in the Bluescript vector(Stratagene, San Diego, Calif.). Bacterial colonies carrying individualsubclones were picked into 96 well microtiter plates, and multiplereplicas were prepared by plating the cells onto nitrocellulose. Thereplicate filters were hybridized to probes representing known membersof the family, and DNA was prepared from non-hybridzing colonies forsequence analysis.

The primer combination of SJL141 and SJL147, encoding the amino acidsequences GW(H/Q/N/K/D/E)(D/N)W(V/I/M)(V/I/M)(A/S)P (SEQ ID NO:9) andM(V/I/M/T/A)V(D/E)SC(G/A)C (SEQ ID NO:10), respectively, yielded fourpreviously identified sequences (BMP-4, inhibin βB, GDF-3 and GDF-5) andone novel sequence, which was designated GDF-8, among 110 subclonesanalyzed.

Human GDF-8 was isolated using the primers: ##STR2## PCR using theseprimers was carried out with one μg human genomic DNA at 94° C. for 1min, 58° C. for 2 min, and 72° C. for 2 min for 30 cycles. The PCRproduct was digested with Bam HI, gel-purified, and subcloned in theBluescript vector (Stratagene, San Francisco, Calif.).

EXAMPLE 2 EXPRESSION PATTERN AND SEQUENCE OF GDF-8

To determine the expression pattern of GDF-8, RNA samples prepared froma variety of adult tissues were screened by Northern analysis. RNAisolation and Northern analysis were carried out as described previously(Lee, S.-J., Mol. Endocrinol., 4:1034, 1990) except that hybridizationwas carried out in 5× SSPE, 10% dextran sulfate, 50% formamide, 1% SDS,200 μg/ml salmon DNA, and 0.1% each of bovine serum albumin, ficoll, andpolyvinylpyrrolidone. Five micrograms of twice poly A-selected RNAprepared from each tissue (except for muscle, for which only 2 μg RNAwas used) were electrophoresed on formaldehyde gels, blotted, and probedwith GDF-8. As shown in FIG. 1, the GDF-8 probe detected a single mRNAspecies expressed at highest levels in muscle and at significantly lowerlevels in adipose tissue.

To obtain a larger segment of the GDF-8 gene, a mouse genomic librarywas screened with a probe derived from the GDF-8 PCR product. Thepartial sequence of a GDF-8 genomic clone is shown in FIG. 2a. Thesequence contains an open reading frame corresponding to the predictedC-terminal region of the GDF-8 precursor protein. The predicted GDF-8sequence contains two potential proteolytic processing sites, which areboxed. Cleavage of the precursor at the second of these sites wouldgenerate a mature C-terminal fragment 109 amino acids in length with apredicted molecular weight of 12,400. The partial sequence of humanGDF-8 is shown in FIG. 2b. Assuming no PCR-induced errors during theisolation of the human clone, the human and mouse amino acid sequencesin this region are 100% identical.

The C-terminal region of GDF-8 following the putative proteolyticprocessing site shows significant homology to the known members of theTGF-β superfamily (FIG. 3). FIG. 3 shows the alignment of the C-terminalsequences of GDF-8 with the corresponding regions of human GDF-1 (Lee,Proc. Natl. Acad. Sci. U.S.A., 88:4250-4254, 1991), human BMP-2 and 4(Wozney, et al., Science, 242:1528-1534, 1988), human Vgr-1 (Celeste, etal., Proc. Natl. Acad. Sci. U.S.A., 87:9843-9847, 1990), human OP-1(Ozkaynak, et al., EMBO J., 9:2085-2093, 1990), human BMP-5 (Celeste, etal., Proc. Natl. Acad. Sci. U.S.A., 87:9843-9847, 1990), human BMP-3(Wozney, et al., Science, 242:1528-1534,1988), human MIS (Cate, et al.,Cell, 45:685-698, 1986), human inhibin alpha, βA, and βB (Mason, et al.,Biochem, Biophys. Res. Commun., 135:957-964, 1986), human TGF-:1(Derynck, et al., Nature, 316:701-705, 1985), humanTGF-β2 (deMartin, etal., EMBO J., 6:3673-3677, 1987), and human TGF-β3 (ten Dijke, et al.,Proc. Natl. Acad. Sci. U.S.A., 85:4715-4719, 1988). The conservedcysteine residues are boxed. Dashes denote gaps introduced in order tomaximize the alignment.

GDF-8 contains most of the residues that are highly conserved in otherfamily members, including the seven cysteine residues with theircharacteristic spacing. Like the TGF-βs and inhibin βs, GDF-8 alsocontains two additional cysteine residues. In the case of TGF-β2, thesetwo additional cysteine residues are known to form an intramoleculardisulfide bond (Daopin, et al., Science, 257:369, 1992; Schlunegger andGrutter, Nature, 358:430, 1992).

FIG. 4 shows the amino acid homologies among the different members ofthe TGF-β superfamily. Numbers represent percent amino acid identitiesbetween each pair calculated from the first conserved cysteine to theC-terminus. Boxes represent homologies among highly-related memberswithin particular subgroups. In this region, GDF-8 is most homologous toVgr-1 (45% sequence identity).

EXAMPLE 3 ISOLATION OF cDNA CLONES ENCODING MURINE AND HUMAN GDF-8

In order to isolate full-length cDNA clones encoding murine and humanGDF-8, cDNA libraries were prepared in the lambda ZAP II vector(Stratagene) using RNA prepared from skeletal muscle. From 5 μg of twicepoly A-selected RNA prepared from murine and human muscle, cDNAlibraries consisting of 4.4 million and 1.9 million recombinant phage,respectively, were constructed according to the instructions provided byStratagene. These libraries were screened without amplification. Libraryscreening and characterization of cDNA inserts were carried out asdescribed previously (Lee, Mol. Endocrinol, 4:1034-1040).

From 2.4×10⁶ recombinant phage screened from the murine muscle cDNAlibrary, greater than 280 positive phage were identified using a murineGDF-8 probe derived from a genomic clone, as described in Example 1. Theentire nucleotide sequence of the longest cDNA insert analyzed is shownin FIG. 5a and SEQ ID NO:11. The 2676 base pair sequence contains asingle long open reading frame beginning with a methionine codon atnucleotide 104 and extending to a TGA stop codon at nucleotide 1232.Upstream of the putative initiating methionine codon is an in-frame stopcodon at nucleotide 23. The predicted pre-pro-GDF-8 protein is 376 aminoacids in length. The sequence contains a core of hydrophobic amino acidsat the N-terminus suggestive of a signal peptide for secretion (FIG.6a), one potential N-glycosylation site at asparagine 72, a putativeRXXR (SEQ ID NO:32) proteolytic cleavage site at amino acids 264-267,and a C-terminal region showing significant homology to the knownmembers of the TGF-β superfamily. Cleavage of the precursor protein atthe putative RXXR (SEQ ID NO:32) site would generate a mature C-terminalGDF-8 fragment 109 amino acids in length with a predicted molecularweight of approximately 12,400.

From 1.9×10⁶ recombinant phage screened from the human muscle cDNAlibrary, 4 positive phage were identified using a human GDF-8 probederived by polymerase chain reaction on human genomic DNA. The entirenucleotide sequence of the longest cDNA insert is shown in FIG. 5b andSEQ ID NO:13. The 2743 base pair sequence contains a single long openreading frame beginning with a methionine codon at nucleotide 59 andextending to a TGA stop codon at nucleotide 1184. The predictedpre-pro-GDF-8 protein is 375 amino acids in length. The sequencecontains a core of hydrophobic amino acids at the N-terminus suggestiveof a signal peptide for secretion (FIG. 6b), one potentialN-glycosylation site at asparagine 71, and a putative RXXR (SEQ IDNO:32) proteolytic cleavage site at amino acids 263-266. FIG. 7 shows acomparison of the predicted murine (top) and human (bottom) GDF-8 aminoacid sequences. Numbers indicate amino acid position relative to theN-terminus. Identities between the two sequences are denoted by avertical line. Murine and human GDF-8 are approximately 94% identical inthe predicted pro-regions and 100% identical following the predictedRXXR (SEQ ID NO:32) cleavage sites.

EXAMPLE 4 PREPARATION OF ANTIBODIES AGAINST GDF-8 AND EXPRESSION OFGDF-8 IN MAMMALIAN CELLS

In order to prepare antibodies against GDF-8, GDF-8 antigen wasexpressed as a fusion protein in bacteria. A portion of murine GDF-8cDNA spanning amino acids 268-376 (mature region) was inserted into thepRSET vector (Invitrogen) such that the GDF-8 coding sequence was placedin frame with the initiating methionine codon present in the vector; theresulting construct created an open reading frame encoding a fusionprotein with a molecular weight of approximately 16,600. The fusionconstruct was transformed into BL21 (DE3) (pLysS) cells, and expressionof the fusion protein was induced by treatment withisopropylthio-β-galactoside as described (Rosenberg, et al., Gene,56:125-135). The fusion protein was then purified by metal chelatechromatography according to the instructions provided by Invitrogen. ACoomassie blue-stained gel of unpurified and purified fusion proteins isshown in FIG. 8.

The purified fusion protein was used to immunize both rabbits andchickens. Immunization of rabbits was carried out by Spring Valley Labs(Sykesville, Md.), and immunization of chickens was carried out by HRP,Inc. (Denver, Pa.). Western analysis of sera both from immunized rabbitsand from immunized chickens demonstrated the presence of antibodiesdirected against the fusion protein.

To express GDF-8 in mammalian cells, the murine GDF-8 cDNA sequence fromnucleotides 48-1303 was cloned in both orientations downstream of themetallothionein I promoter in the pMSXND expression vector; this vectorcontains processing signals derived from SV40, a dihydrofolate reductasegene, and a gene conferring resistance to the antibiotic G418 (Lee andNathans, J. Biol. Chem., 263:3521-3527). The resulting constructs weretransfected into Chinese hamster ovary cells, and stable tranfectantswere selected in the presence of G418. Two milliliters of conditionedmedia prepared from the G418-resistant cells were dialyzed, lyophilized,electrophoresed under denaturing, reducing conditions, transferred tonitrocellulose, and incubated with anti-GDF-8 antibodies (describedabove) and ¹²⁵ I!iodoproteinA.

As shown in FIG. 9, the rabbit GDF-8 antibodies (at a 1:500 dilution)detected a protein of approximately the predicted molecular weight forthe mature C-terminal fragment of GDF-8 in the conditioned media ofcells transfected with a construct in which GDF-8 had been cloned in thecorrect (sense) orientation with respect to the metallothionein promoter(lane 2); this band was not detected in a similar sample prepared fromcells transfected with a control antisense construct (lane 1). Similarresults were obtained using antibodies prepared in chickens. Hence,GDF-8 is secreted and proteolytically processed by these transfectedmammalian cells.

EXAMPLE 5 EXPRESSION PATTERN OF GDF-8

To determine the pattern of GDF-8, 5 μg of twice poly A-selected RNAprepared from a variety of murine tissue sources were subjected toNorthern analysis. As shown in FIG. 10a (and as shown previously inExample 2), the GDF-8 probe detected a single mRNA species presentalmost exclusively in skeletal muscle among a large number of adulttissues surveyed. On longer exposures of the same blot, significantlylower but detectable levels of GDF-8 mRNA were seen in fat, brain,thymus, heart, and lung. Hence, these results confirm the high degree ofspecificity of GDF-8 expression in skeletal muscle. GDF-8 mRNA was alsodetected in mouse embryos at both gestational ages (day 12.5 and day18.5 post-coital) examined but not in placentas at various stages ofdevelopment (FIG. 10b).

EXAMPLE 6 CHROMOSOMAL LOCALIZATION OF GDF-8

In order to map the chromosomal location of GDF-8, DNA samples fromhuman/rodent somatic cell hybrids (Drwinga, et al., Genomics,16:311-413, 1993; Dubois and Naylor, Genomics, 16:315-319, 1993) wereanalyzed by polymerase chain reaction followed by Southern blotting.Polymerase chain reaction was carried out using primer #83,5'-CGCGGATCCGTGGATCTAAATGAGAACAGTGAGC-3'(SEQ ID NO:15) and primer #84,5'-CGCGAATTCTCAGGTAATGATTGTTTCCGTTGTAGCG-3'(SEQ ID NO:16) for 40 cyclesat 94° C. for 2 minutes, 60° C. for 1 minute, and 72° C. for 2 minutes.These primers correspond to nucleotides 119 to 143 (flanked by a Bam H1recognition sequence), and nucleotides 394 to 418 (flanked by an Eco R1recognition sequence), respectively, in the human GDF-8 cDNA sequence.PCR products were electrophoresed on agarose gels, blotted, and probedwith oligonucleotide #100, 5'-ACACTAAATCTTCAAGAATA-3'(SEQ ID NO:17),which corresponds to a sequence internal to the region flanked by primer#83 and #84. Filters were hybridized in 6×SSC, 1×Denhardt's solution,100, μg/ml yeast transfer RNA, and 0.05% sodium pyrophosphate at 50° C.

As shown in FIG. 11, the human-specific probe detected a band of thepredicted size (approximately 320 base pairs) in the positive controlsample (total human genomic DNA) and in a single DNA sample from thehuman/rodent hybrid panel. This positive signal corresponds to humanchromosome 2. The human chromosome contained in each of the hybrid celllines is identified at the top of each of the first 24 lanes (1-22, X,and Y). In the lanes designated M, CHO, and H, the starting DNA templatewas total genomic DNA from mouse, hamster, and human sources,respectively. In the lane marked B1, no template DNA was used. Numbersat left indicate the mobilities of DNA standards. These data show thatthe human GDF-8 gene is located on chromosome 2.

Although the invention has been described with reference to thepresently preferred embodiment, it should be understood that variousmodifications can be made without departing from the spirit of theinvention. Accordingly, the invention is limited only by the followingclaims.

SUMMARY OF SEQUENCES

SEQ ID NO:1 is the nucleic acid sequence for clone SJL141.

SEQ ID NO:2 is the nucleic acid sequence for clone SJL147.

SEQ ID NO:3 is the nucleic acid sequence for clone ACM13.

SEQ ID NO:4 is the nucleic acid sequence for clone ACM14.

SEQ ID NO:5 is the partial nucleotide sequence and deduced amino acidsequence for murine GDF-8.

SEQ ID NO:6 is the deduced partial amino acid sequence for murine GDF-8.

SEQ ID NO:7 is the partial nucleotide sequence and deduced amino acidsequence for human GDF-8.

SEQ ID NO:8 is the deduced partial amino acid sequence for human GDF-8.

SEQ ID NO:9 is the amino acid sequence for primer SJL141.

SEQ ID NO:10 is the amino acid sequence for primer SJL147.

SEQ ID NO:11 is the nucleotide and deduced amino acid sequence formurine GDF-8.

SEQ ID NO:12 is the deduced amino acid sequence for murine GDF-8.

SEQ ID NO:13 is the nucleotide and deduced amino acid sequence for humanGDF-8.

SEQ ID NO:14 is the deduced amino acid sequence for human GDF-8.

SEQ ID NO's:15 and 16 are nucleotide sequences for primer #83 and #84,respectively, which were used to map human GDF-8 in human/rodent somaticcell hybrids.

SEQ ID NO:17 is the nucleotide sequence of oligonucleotide #100 whichcorresponds to a sequence internal to the region flanked by primer #83and #84.

    __________________________________________________________________________    SEQUENCE LISTING                                                              (1) GENERAL INFORMATION:                                                      (iii) NUMBER OF SEQUENCES: 32                                                 (2) INFORMATION FOR SEQ ID NO:1:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 35 base pairs                                                     (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: SJL141                                                             (ix) FEATURE:                                                                 (A) NAME/KEY: Modified Base                                                   (B) LOCATION: 1...35                                                          (D) OTHER INFORMATION: /note= "N=inosine"                                     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1:                                       CCGGAATTCGGNTGGVANRAYTGGRTNRTNKCNCC35                                         (2) INFORMATION FOR SEQ ID NO:2:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 33 base pairs                                                     (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: SJL147                                                             (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 1...33                                                          (C) OTHER INFORMATION: /note= "N-inosine"                                     (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2:                                       CCGGAATTCRCANSCRCARCTNTCNACNRYCAT33                                           (2) INFORMATION FOR SEQ ID NO:3:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 32 base pairs                                                     (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: ACM13                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 1...32                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:                                       CGCGGATCCAGAAGTCAAGGTGACAGACACAC32                                            (2) INFORMATION FOR SEQ ID NO:4:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 33 base pairs                                                     (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: ACM14                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 1...33                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:                                       CGCGGATCCTCCTCATGAGCACCCACAGCGGTC33                                           (2) INFORMATION FOR SEQ ID NO:5:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 550 base pairs                                                    (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: mouse GDF-8                                                        (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 59...436                                                        (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5:                                       TTAAGGTAGGAAGGATTTCAGGCTCTATTTACATAATTGTTCTTTCCTTTTCACACAG58                  AATCCCTTTTTAGAAGTCAAGGTGACAGACACACCCAAGAGGTCCCGG106                           AsnProPheLeuGluValLysValThrAspThrProLysArgSerArg                              151015                                                                        AGAGACTTTGGGCTTGACTGCGATGAGCACTCCACGGAATCCCGGTGC154                           ArgAspPheGlyLeuAspCysAspGluHisSerThrGluSerArgCys                              202530                                                                        TGCCGCTACCCCCTCACGGTCGATTTTGAAGCCTTTGGATGGGACTGG202                           CysArgTyrProLeuThrValAspPheGluAlaPheGlyTrpAspTrp                              354045                                                                        ATTATCGCACCCAAAAGATATAAGGCCAATTACTGCTCAGGAGAGTGT250                           IleIleAlaProLysArgTyrLysAlaAsnTyrCysSerGlyGluCys                              505560                                                                        GAATTTGTGTTTTTACAAAAATATCCGCATACTCATCTTGTGCACCAA298                           GluPheValPheLeuGlnLysTyrProHisThrHisLeuValHisGln                              65707580                                                                      GCAAACCCCAGAGGCTCAGCAGGCCCTTGCTGCACTCCGACAAAAATG346                           AlaAsnProArgGlySerAlaGlyProCysCysThrProThrLysMet                              859095                                                                        TCTCCCATTAATATGCTATATTTTAATGGCAAAGAACAAATAATATAT394                           SerProIleAsnMetLeuTyrPheAsnGlyLysGluGlnIleIleTyr                              100105110                                                                     GGGAAAATTCCAGCCATGGTAGTAGACCGCTGTGGGTGCTCATGAGCTTTGC446                       GlyLysIleProAlaMetValValAspArgCysGlyCysSer                                    115120125                                                                     ATTAGGTTAGAAACTTCCCAAGTCATGGAAGGTCTTCCCCTCAATTTCGAAACTGTGAAT506               TCCTGCAGCCCGGGGGATCCACTAGTTCTAGAGCGGCCGCCACC550                               (2) INFORMATION FOR SEQ ID NO:6:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 126 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (v) FRAGMENT TYPE: internal                                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:                                       AsnProPheLeuGluValLysValThrAspThrProLysArgSerArg                              151015                                                                        ArgAspPheGlyLeuAspCysAspGluHisSerThrGluSerArgCys                              202530                                                                        CysArgTyrProLeuThrValAspPheGluAlaPheGlyTrpAspTrp                              354045                                                                        IleIleAlaProLysArgTyrLysAlaAsnTyrCysSerGlyGluCys                              505560                                                                        GluPheValPheLeuGlnLysTyrProHisThrHisLeuValHisGln                              65707580                                                                      AlaAsnProArgGlySerAlaGlyProCysCysThrProThrLysMet                              859095                                                                        SerProIleAsnMetLeuTyrPheAsnGlyLysGluGlnIleIleTyr                              100105110                                                                     GlyLysIleProAlaMetValValAspArgCysGlyCysSer                                    115120125                                                                     (2) INFORMATION FOR SEQ ID NO:7:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 326 base pairs                                                    (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: human GDF-8                                                        (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 3...326                                                         (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:                                       CAAAAAGATCCAGAAGGGATTTTGGTCTTGACTGTGATGAGCACTCA47                             LysArgSerArgArgAspPheGlyLeuAspCysAspGluHisSer                                 151015                                                                        ACAGAATCACGATGCTGTCGTTACCCTCTAACTGTGGATTTTGAAGCT95                            ThrGluSerArgCysCysArgTyrProLeuThrValAspPheGluAla                              202530                                                                        TTTGGATGGGATTGGATTATCGCTCCTAAAAGATATAAGGCCAATTAC143                           PheGlyTrpAspTrpIleIleAlaProLysArgTyrLysAlaAsnTyr                              354045                                                                        TGCTCTGGAGAGTGTGAATTTGTATTTTTACAAAAATATCCTCATACT191                           CysSerGlyGluCysGluPheValPheLeuGlnLysTyrProHisThr                              505560                                                                        CATCTGGTACACCAAGCAAACCCCAGAGGTTCAGCAGGCCCTTGCTGT239                           HisLeuValHisGlnAlaAsnProArgGlySerAlaGlyProCysCys                              657075                                                                        ACTCCCACAAAGATGTCTCCAATTAATATGCTATATTTTAATGGCAAA287                           ThrProThrLysMetSerProIleAsnMetLeuTyrPheAsnGlyLys                              80859095                                                                      GAACAAATAATATATGGGAAAATTCCAGCGATGGTAGTA326                                    GluGlnIleIleTyrGlyLysIleProAlaMetValVal                                       100105                                                                        (2) INFORMATION FOR SEQ ID NO:8:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 108 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (v) FRAGMENT TYPE: internal                                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:                                       LysArgSerArgArgAspPheGlyLeuAspCysAspGluHisSerThr                              151015                                                                        GluSerArgCysCysArgTyrProLeuThrValAspPheGluAlaPhe                              202530                                                                        GlyTrpAspTrpIleIleAlaProLysArgTyrLysAlaAsnTyrCys                              354045                                                                        SerGlyGluCysGluPheValPheLeuGlnLysTyrProHisThrHis                              505560                                                                        LeuValHisGlnAlaAsnProArgGlySerAlaGlyProCysCysThr                              65707580                                                                      ProThrLysMetSerProIleAsnMetLeuTyrPheAsnGlyLysGlu                              859095                                                                        GlnIleIleTyrGlyLysIleProAlaMetValVal                                          100105                                                                        (2) INFORMATION FOR SEQ ID NO:9:                                              (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 9 amino acids                                                     (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: peptide                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: SJL141                                                             (ix) FEATURE:                                                                 (A) NAME/KEY: Peptide                                                         (B) LOCATION: 1...9                                                           (D) OTHER INFORMATION: /note= "Xaa at position 3=His, Gln, Asn                Lys, Asp or Glu; Xaa at position 4=Asp or Asn; Xaa at                         positions 6 and 7=Val, Ile or Met; Ala = Xaa at position                      8=Ala or Ser"                                                                 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9:                                       GlyTrpXaaXaaTrpXaaXaaXaaPro                                                   15                                                                            (2) INFORMATION FOR SEQ ID NO:10:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 8 amino acids                                                     (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: peptide                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: SJL147                                                             (ix) FEATURE:                                                                 (A) NAME/KEY: Peptide                                                         (B) LOCATION: 1...8                                                           (D) OTHER INFORMATION: /note= "Xaa at position 2=Ile, Val, Met                Thr or Ala; Xaa at position 4=Asp or Glu; Xaa at position                     7=Gly or Ala"                                                                 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:10:                                      MetXaaValXaaSerCysXaaCys                                                      15                                                                            (2) INFORMATION FOR SEQ ID NO:11:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 2676 base pairs                                                   (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Murine GDF-8                                                       (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 104...1231                                                      (xi) SEQUENCE DESCRIPTION: SEQ ID NO:11:                                      GTCTCTCGGACGGTACATGCACTAATATTTCACTTGGCATTACTCAAAAGCAAAAAGAAG60                AAATAAGAACAAGGGAAAAAAAAAGATTGTGCTGATTTTTAAAATGATGCAAAAA115                    MetMetGlnLys                                                                  CTGCAAATGTATGTTTATATTTACCTGTTCATGCTGATTGCTGCTGGC163                           LeuGlnMetTyrValTyrIleTyrLeuPheMetLeuIleAlaAlaGly                              5101520                                                                       CCAGTGGATCTAAATGAGGGCAGTGAGAGAGAAGAAAATGTGGAAAAA211                           ProValAspLeuAsnGluGlySerGluArgGluGluAsnValGluLys                              253035                                                                        GAGGGGCTGTGTAATGCATGTGCGTGGAGACAAAACACGAGGTACTCC259                           GluGlyLeuCysAsnAlaCysAlaTrpArgGlnAsnThrArgTyrSer                              404550                                                                        AGAATAGAAGCCATAAAAATTCAAATCCTCAGTAAGCTGCGCCTGGAA307                           ArgIleGluAlaIleLysIleGlnIleLeuSerLysLeuArgLeuGlu                              556065                                                                        ACAGCTCCTAACATCAGCAAAGATGCTATAAGACAACTTCTGCCAAGA355                           ThrAlaProAsnIleSerLysAspAlaIleArgGlnLeuLeuProArg                              707580                                                                        GCGCCTCCACTCCGGGAACTGATCGATCAGTACGACGTCCAGAGGGAT403                           AlaProProLeuArgGluLeuIleAspGlnTyrAspValGlnArgAsp                              859095100                                                                     GACAGCAGTGATGGCTCTTTGGAAGATGACGATTATCACGCTACCACG451                           AspSerSerAspGlySerLeuGluAspAspAspTyrHisAlaThrThr                              105110115                                                                     GAAACAATCATTACCATGCCTACAGAGTCTGACTTTCTAATGCAAGCG499                           GluThrIleIleThrMetProThrGluSerAspPheLeuMetGlnAla                              120125130                                                                     GATGGCAAGCCCAAATGTTGCTTTTTTAAATTTAGCTCTAAAATACAG547                           AspGlyLysProLysCysCysPhePheLysPheSerSerLysIleGln                              135140145                                                                     TACAACAAAGTAGTAAAAGCCCAACTGTGGATATATCTCAGACCCGTC595                           TyrAsnLysValValLysAlaGlnLeuTrpIleTyrLeuArgProVal                              150155160                                                                     AAGACTCCTACAACAGTGTTTGTGCAAATCCTGAGACTCATCAAACCC643                           LysThrProThrThrValPheValGlnIleLeuArgLeuIleLysPro                              165170175180                                                                  ATGAAAGACGGTACAAGGTATACTGGAATCCGATCTCTGAAACTTGAC691                           MetLysAspGlyThrArgTyrThrGlyIleArgSerLeuLysLeuAsp                              185190195                                                                     ATGAGCCCAGGCACTGGTATTTGGCAGAGTATTGATGTGAAGACAGTG739                           MetSerProGlyThrGlyIleTrpGlnSerIleAspValLysThrVal                              200205210                                                                     TTGCAAAATTGGCTCAAACAGCCTGAATCCAACTTAGGCATTGAAATC787                           LeuGlnAsnTrpLeuLysGlnProGluSerAsnLeuGlyIleGluIle                              215220225                                                                     AAAGCTTTGGATGAGAATGGCCATGATCTTGCTGTAACCTTCCCAGGA835                           LysAlaLeuAspGluAsnGlyHisAspLeuAlaValThrPheProGly                              230235240                                                                     CCAGGAGAAGATGGGCTGAATCCCTTTTTAGAAGTCAAGGTGACAGAC883                           ProGlyGluAspGlyLeuAsnProPheLeuGluValLysValThrAsp                              245250255260                                                                  ACACCCAAGAGGTCCCGGAGAGACTTTGGGCTTGACTGCGATGAGCAC931                           ThrProLysArgSerArgArgAspPheGlyLeuAspCysAspGluHis                              265270275                                                                     TCCACGGAATCCCGGTGCTGCCGCTACCCCCTCACGGTCGATTTTGAA979                           SerThrGluSerArgCysCysArgTyrProLeuThrValAspPheGlu                              280285290                                                                     GCCTTTGGATGGGACTGGATTATCGCACCCAAAAGATATAAGGCCAAT1027                          AlaPheGlyTrpAspTrpIleIleAlaProLysArgTyrLysAlaAsn                              295300305                                                                     TACTGCTCAGGAGAGTGTGAATTTGTGTTTTTACAAAAATATCCGCAT1075                          TyrCysSerGlyGluCysGluPheValPheLeuGlnLysTyrProHis                              310315320                                                                     ACTCATCTTGTGCACCAAGCAAACCCCAGAGGCTCAGCAGGCCCTTGC1123                          ThrHisLeuValHisGlnAlaAsnProArgGlySerAlaGlyProCys                              325330335340                                                                  TGCACTCCGACAAAAATGTCTCCCATTAATATGCTATATTTTAATGGC1171                          CysThrProThrLysMetSerProIleAsnMetLeuTyrPheAsnGly                              345350355                                                                     AAAGAACAAATAATATATGGGAAAATTCCAGCCATGGTAGTAGACCGC1219                          LysGluGlnIleIleTyrGlyLysIleProAlaMetValValAspArg                              360365370                                                                     TGTGGGTGCTCATGAGCTTTGCATTAGGTTAGAAACTTCCCAAGTCATGGAAGGTCT1276                 CysGlyCysSer                                                                  375                                                                           TCCCCTCAATTTCGAAACTGTGAATTCAAGCACCACAGGCTGTAGGCCTTGAGTATGCTC1336              TAGTAACGTAAGCACAAGCTACAGTGTATGAACTAAAAGAGAGAATAGATGCAATGGTTG1396              GCATTCAACCACCAAAATAAACCATACTATAGGATGTTGTATGATTTCCAGAGTTTTTGA1456              AATAGATGGAGATCAAATTACATTTATGTCCATATATGTATATTACAACTACAATCTAGG1516              CAAGGAAGTGAGAGCACATCTTGTGGTCTGCTGAGTTAGGAGGGTATGATTAAAAGGTAA1576              AGTCTTATTTCCTAACAGTTTCACTTAATATTTACAGAAGAATCTATATGTAGCCTTTGT1636              AAAGTGTAGGATTGTTATCATTTAAAAACATCATGTACACTTATATTTGTATTGTATACT1696              TGGTAAGATAAAATTCCACAAAGTAGGAATGGGGCCTCACATACACATTGCCATTCCTAT1756              TATAATTGGACAATCCACCACGGTGCTAATGCAGTGCTGAATGGCTCCTACTGGACCTCT1816              CGATAGAACACTCTACAAAGTACGAGTCTCTCTCTCCCTTCCAGGTGCATCTCCACACAC1876              ACAGCACTAAGTGTTCAATGCATTTTCTTTAAGGAAAGAAGAATCTTTTTTTCTAGAGGT1936              CAACTTTCAGTCAACTCTAGCACAGCGGGAGTGACTGCTGCATCTTAAAAGGCAGCCAAA1996              CAGTATTCATTTTTTAATCTAAATTTCAAAATCACTGTCTGCCTTTATCACATGGCAATT2056              TTGTGGTAAAATAATGGAAATGACTGGTTCTATCAATATTGTATAAAAGACTCTGAAACA2116              ATTACATTTATATAATATGTATACAATATTGTTTTGTAAATAAGTGTCTCCTTTTATATT2176              TACTTTGGTATATTTTTACACTAATGAAATTTCAAATCATTAAAGTACAAAGACATGTCA2236              TGTATCACAAAAAAGGTGACTGCTTCTATTTCAGAGTGAATTAGCAGATTCAATAGTGGT2296              CTTAAAACTCTGTATGTTAAGATTAGAAGGTTATATTACAATCAATTTATGTATTTTTTA2356              CATTATCAACTTATGGTTTCATGGTGGCTGTATCTATGAATGTGGCTCCCAGTCAAATTT2416              CAATGCCCCACCATTTTAAAAATTACAAGCATTACTAAACATACCAACATGTATCTAAAG2476              AAATACAAATATGGTATCTCAATAACAGCTACTTTTTTATTTTATAATTTGACAATGAAT2536              ACATTTCTTTTATTTACTTCAGTTTTATAAATTGGAACTTTGTTTATCAAATGTATTGTA2596              CTCATAGCTAAATGAAATTATTTCTTACATAAAAATGTGTAGAAACTATAAATTAAAGTG2656              TTTTCACATTTTTGAAAGGC2676                                                      (2) INFORMATION FOR SEQ ID NO:12:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 376 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (v) FRAGMENT TYPE: internal                                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12:                                      MetMetGlnLysLeuGlnMetTyrValTyrIleTyrLeuPheMetLeu                              151015                                                                        IleAlaAlaGlyProValAspLeuAsnGluGlySerGluArgGluGlu                              202530                                                                        AsnValGluLysGluGlyLeuCysAsnAlaCysAlaTrpArgGlnAsn                              354045                                                                        ThrArgTyrSerArgIleGluAlaIleLysIleGlnIleLeuSerLys                              505560                                                                        LeuArgLeuGluThrAlaProAsnIleSerLysAspAlaIleArgGln                              65707580                                                                      LeuLeuProArgAlaProProLeuArgGluLeuIleAspGlnTyrAsp                              859095                                                                        ValGlnArgAspAspSerSerAspGlySerLeuGluAspAspAspTyr                              100105110                                                                     HisAlaThrThrGluThrIleIleThrMetProThrGluSerAspPhe                              115120125                                                                     LeuMetGlnAlaAspGlyLysProLysCysCysPhePheLysPheSer                              130135140                                                                     SerLysIleGlnTyrAsnLysValValLysAlaGlnLeuTrpIleTyr                              145150155160                                                                  LeuArgProValLysThrProThrThrValPheValGlnIleLeuArg                              165170175                                                                     LeuIleLysProMetLysAspGlyThrArgTyrThrGlyIleArgSer                              180185190                                                                     LeuLysLeuAspMetSerProGlyThrGlyIleTrpGlnSerIleAsp                              195200205                                                                     ValLysThrValLeuGlnAsnTrpLeuLysGlnProGluSerAsnLeu                              210215220                                                                     GlyIleGluIleLysAlaLeuAspGluAsnGlyHisAspLeuAlaVal                              225230235240                                                                  ThrPheProGlyProGlyGluAspGlyLeuAsnProPheLeuGluVal                              245250255                                                                     LysValThrAspThrProLysArgSerArgArgAspPheGlyLeuAsp                              260265270                                                                     CysAspGluHisSerThrGluSerArgCysCysArgTyrProLeuThr                              275280285                                                                     ValAspPheGluAlaPheGlyTrpAspTrpIleIleAlaProLysArg                              290295300                                                                     TyrLysAlaAsnTyrCysSerGlyGluCysGluPheValPheLeuGln                              305310315320                                                                  LysTyrProHisThrHisLeuValHisGlnAlaAsnProArgGlySer                              325330335                                                                     AlaGlyProCysCysThrProThrLysMetSerProIleAsnMetLeu                              340345350                                                                     TyrPheAsnGlyLysGluGlnIleIleTyrGlyLysIleProAlaMet                              355360365                                                                     ValValAspArgCysGlyCysSer                                                      370375                                                                        (2) INFORMATION FOR SEQ ID NO:13:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 2743 base pairs                                                   (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Human GDF-8                                                        (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 59...1183                                                       (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:                                      AAGAAAAGTAAAAGGAAGAAACAAGAACAAGAAAAAAGATTATATTGATTTTAAAATC58                  ATGCAAAAACTGCAACTCTGTGTTTATATTTACCTGTTTATGCTGATT106                           MetGlnLysLeuGlnLeuCysValTyrIleTyrLeuPheMetLeuIle                              151015                                                                        GTTGCTGGTCCAGTGGATCTAAATGAGAACAGTGAGCAAAAAGAAAAT154                           ValAlaGlyProValAspLeuAsnGluAsnSerGluGlnLysGluAsn                              202530                                                                        GTGGAAAAAGAGGGGCTGTGTAATGCATGTACTTGGAGACAAAACACT202                           ValGluLysGluGlyLeuCysAsnAlaCysThrTrpArgGlnAsnThr                              354045                                                                        AAATCTTCAAGAATAGAAGCCATTAAGATACAAATCCTCAGTAAACTT250                           LysSerSerArgIleGluAlaIleLysIleGlnIleLeuSerLysLeu                              505560                                                                        CGTCTGGAAACAGCTCCTAACATCAGCAAAGATGTTATAAGACAACTT298                           ArgLeuGluThrAlaProAsnIleSerLysAspValIleArgGlnLeu                              65707580                                                                      TTACCCAAAGCTCCTCCACTCCGGGAACTGATTGATCAGTATGATGTC346                           LeuProLysAlaProProLeuArgGluLeuIleAspGlnTyrAspVal                              859095                                                                        CAGAGGGATGACAGCAGCGATGGCTCTTTGGAAGATGACGATTATCAC394                           GlnArgAspAspSerSerAspGlySerLeuGluAspAspAspTyrHis                              100105110                                                                     GCTACAACGGAAACAATCATTACCATGCCTACAGAGTCTGATTTTCTA442                           AlaThrThrGluThrIleIleThrMetProThrGluSerAspPheLeu                              115120125                                                                     ATGCAAGTGGATGGAAAACCCAAATGTTGCTTCTTTAAATTTAGCTCT490                           MetGlnValAspGlyLysProLysCysCysPhePheLysPheSerSer                              130135140                                                                     AAAATACAATACAATAAAGTAGTAAAGGCCCAACTATGGATATATTTG538                           LysIleGlnTyrAsnLysValValLysAlaGlnLeuTrpIleTyrLeu                              145150155160                                                                  AGACCCGTCGAGACTCCTACAACAGTGTTTGTGCAAATCCTGAGACTC586                           ArgProValGluThrProThrThrValPheValGlnIleLeuArgLeu                              165170175                                                                     ATCAAACCTATGAAAGACGGTACAAGGTATACTGGAATCCGATCTCTG634                           IleLysProMetLysAspGlyThrArgTyrThrGlyIleArgSerLeu                              180185190                                                                     AAACTTGACATGAACCCAGGCACTGGTATTTGGCAGAGCATTGATGTG682                           LysLeuAspMetAsnProGlyThrGlyIleTrpGlnSerIleAspVal                              195200205                                                                     AAGACAGTGTTGCAAAATTGGCTCAAACAACCTGAATCCAACTTAGGC730                           LysThrValLeuGlnAsnTrpLeuLysGlnProGluSerAsnLeuGly                              210215220                                                                     ATTGAAATAAAAGCTTTAGATGAGAATGGTCATGATCTTGCTGTAACC778                           IleGluIleLysAlaLeuAspGluAsnGlyHisAspLeuAlaValThr                              225230235240                                                                  TTCCCAGGACCAGGAGAAGATGGGCTGAATCCGTTTTTAGAGGTCAAG826                           PheProGlyProGlyGluAspGlyLeuAsnProPheLeuGluValLys                              245250255                                                                     GTAACAGACACACCAAAAAGATCCAGAAGGGATTTTGGTCTTGACTGT874                           ValThrAspThrProLysArgSerArgArgAspPheGlyLeuAspCys                              260265270                                                                     GATGAGCACTCAACAGAATCACGATGCTGTCGTTACCCTCTAACTGTG922                           AspGluHisSerThrGluSerArgCysCysArgTyrProLeuThrVal                              275280285                                                                     GATTTTGAAGCTTTTGGATGGGATTGGATTATCGCTCCTAAAAGATAT970                           AspPheGluAlaPheGlyTrpAspTrpIleIleAlaProLysArgTyr                              290295300                                                                     AAGGCCAATTACTGCTCTGGAGAGTGTGAATTTGTATTTTTACAAAAA1018                          LysAlaAsnTyrCysSerGlyGluCysGluPheValPheLeuGlnLys                              305310315320                                                                  TATCCTCATACTCATCTGGTACACCAAGCAAACCCCAGAGGTTCAGCA1066                          TyrProHisThrHisLeuValHisGlnAlaAsnProArgGlySerAla                              325330335                                                                     GGCCCTTGCTGTACTCCCACAAAGATGTCTCCAATTAATATGCTATAT1114                          GlyProCysCysThrProThrLysMetSerProIleAsnMetLeuTyr                              340345350                                                                     TTTAATGGCAAAGAACAAATAATATATGGGAAAATTCCAGCGATGGTA1162                          PheAsnGlyLysGluGlnIleIleTyrGlyLysIleProAlaMetVal                              355360365                                                                     GTAGACCGCTGTGGGTGCTCATGAGATTTATATTAAGCGTTCATAACTTCCTAAAAC1219                 ValAspArgCysGlyCysSer                                                         370375                                                                        ATGGAAGGTTTTCCCCTCAACAATTTTGAAGCTGTGAAATTAAGTACCACAGGCTATAGG1279              CCTAGAGTATGCTACAGTCACTTAAGCATAAGCTACAGTATGTAAACTAAAAGGGGGAAT1339              ATATGCAATGGTTGGCATTTAACCATCCAAACAAATCATACAAGAAAGTTTTATGATTTC1399              CAGAGTTTTTGAGCTAGAAGGAGATCAAATTACATTTATGTTCCTATATATTACAACATC1459              GGCGAGGAAATGAAAGCGATTCTCCTTGAGTTCTGATGAATTAAAGGAGTATGCTTTAAA1519              GTCTATTTCTTTAAAGTTTTGTTTAATATTTACAGAAAAATCCACATACAGTATTGGTAA1579              AATGCAGGATTGTTATATACCATCATTCGAATCATCCTTAAACACTTGAATTTATATTGT1639              ATGGTAGTATACTTGGTAAGATAAAATTCCACAAAAATAGGGATGGTGCAGCATATGCAA1699              TTTCCATTCCTATTATAATTGACACAGTACATTAACAATCCATGCCAACGGTGCTAATAC1759              GATAGGCTGAATGTCTGAGGCTACCAGGTTTATCACATAAAAAACATTCAGTAAAATAGT1819              AAGTTTCTCTTTTCTTCAGGTGCATTTTCCTACACCTCCAAATGAGGAATGGATTTTCTT1879              TAATGTAAGAAGAATCATTTTTCTAGAGGTTGGCTTTCAATTCTGTAGCATACTTGGAGA1939              AACTGCATTATCTTAAAAGGCAGTCAAATGGTGTTTGTTTTTATCAAAATGTCAAAATAA1999              CATACTTGGAGAAGTATGTAATTTTGTCTTTGGAAAATTACAACACTGCCTTTGCAACAC2059              TGCAGTTTTTATGGTAAAATAATAGAAATGATCGACTCTATCAATATTGTATAAAAAGAC2119              TGAAACAATGCATTTATATAATATGTATACAATATTGTTTTGTAAATAAGTGTCTCCTTT2179              TTTATTTACTTTGGTATATTTTTACACTAAGGACATTTCAAATTAAGTACTAAGGCACAA2239              AGACATGTCATGCATCACAGAAAAGCAACTACTTATATTTCAGAGCAAATTAGCAGATTA2299              AATAGTGGTCTTAAAACTCCATATGTTAATGATTAGATGGTTATATTACAATCATTTTAT2359              ATTTTTTTACATGATTAACATTCACTTATGGATTCATGATGGCTGTATAAAGTGAATTTG2419              AAATTTCAATGGTTTACTGTCATTGTGTTTAAATCTCAACGTTCCATTATTTTAATACTT2479              GCAAAAACATTACTAAGTATACCAAAATAATTGACTCTATTATCTGAAATGAAGAATAAA2539              CTGATGCTATCTCAACAATAACTGTTACTTTTATTTTATAATTTGATAATGAATATATTT2599              CTGCATTTATTTACTTCTGTTTTGTAAATTGGGATTTTGTTAATCAAATTTATTGTACTA2659              TGACTAAATGAAATTATTTCTTACATCTAATTTGTAGAAACAGTATAAGTTATATTAAAG2719              TGTTTTCACATTTTTTTGAAAGAC2743                                                  (2) INFORMATION FOR SEQ ID NO:14:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 375 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (v) FRAGMENT TYPE: internal                                                   (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:                                      MetGlnLysLeuGlnLeuCysValTyrIleTyrLeuPheMetLeuIle                              151015                                                                        ValAlaGlyProValAspLeuAsnGluAsnSerGluGlnLysGluAsn                              202530                                                                        ValGluLysGluGlyLeuCysAsnAlaCysThrTrpArgGlnAsnThr                              354045                                                                        LysSerSerArgIleGluAlaIleLysIleGlnIleLeuSerLysLeu                              505560                                                                        ArgLeuGluThrAlaProAsnIleSerLysAspValIleArgGlnLeu                              65707580                                                                      LeuProLysAlaProProLeuArgGluLeuIleAspGlnTyrAspVal                              859095                                                                        GlnArgAspAspSerSerAspGlySerLeuGluAspAspAspTyrHis                              100105110                                                                     AlaThrThrGluThrIleIleThrMetProThrGluSerAspPheLeu                              115120125                                                                     MetGlnValAspGlyLysProLysCysCysPhePheLysPheSerSer                              130135140                                                                     LysIleGlnTyrAsnLysValValLysAlaGlnLeuTrpIleTyrLeu                              145150155160                                                                  ArgProValGluThrProThrThrValPheValGlnIleLeuArgLeu                              165170175                                                                     IleLysProMetLysAspGlyThrArgTyrThrGlyIleArgSerLeu                              180185190                                                                     LysLeuAspMetAsnProGlyThrGlyIleTrpGlnSerIleAspVal                              195200205                                                                     LysThrValLeuGlnAsnTrpLeuLysGlnProGluSerAsnLeuGly                              210215220                                                                     IleGluIleLysAlaLeuAspGluAsnGlyHisAspLeuAlaValThr                              225230235240                                                                  PheProGlyProGlyGluAspGlyLeuAsnProPheLeuGluValLys                              245250255                                                                     ValThrAspThrProLysArgSerArgArgAspPheGlyLeuAspCys                              260265270                                                                     AspGluHisSerThrGluSerArgCysCysArgTyrProLeuThrVal                              275280285                                                                     AspPheGluAlaPheGlyTrpAspTrpIleIleAlaProLysArgTyr                              290295300                                                                     LysAlaAsnTyrCysSerGlyGluCysGluPheValPheLeuGlnLys                              305310315320                                                                  TyrProHisThrHisLeuValHisGlnAlaAsnProArgGlySerAla                              325330335                                                                     GlyProCysCysThrProThrLysMetSerProIleAsnMetLeuTyr                              340345350                                                                     PheAsnGlyLysGluGlnIleIleTyrGlyLysIleProAlaMetVal                              355360365                                                                     ValAspArgCysGlyCysSer                                                         370375                                                                        (2) INFORMATION FOR SEQ ID NO:15:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 34 base pairs                                                     (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: #83                                                                (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 1..34                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:15:                                      CGCGGATCCGTGGATCTAAATGAGAACAGTGAGC34                                          (2) INFORMATION FOR SEQ ID NO:16:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 37 base pairs                                                     (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: #84                                                                (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 1..37                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16:                                      CGCGAATTCTCAGGTAATGATTGTTTCCGTTGTAGCG37                                       (2) INFORMATION FOR SEQ ID NO:17:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 20 base pairs                                                     (B) TYPE: nucleic acid                                                        (C) STRANDEDNESS: single                                                      (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: Genomic DNA                                               (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: #100                                                               (ix) FEATURE:                                                                 (A) NAME/KEY: CDS                                                             (B) LOCATION: 1..20                                                           (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:                                      ACACTAAATCTTCAAGAATA20                                                        (2) INFORMATION FOR SEQ ID NO:18:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 123 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: GDF-1                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..123                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18:                                      ArgProArgArgAspAlaGluProValLeuGlyGlyGlyProGlyGly                              151015                                                                        AlaCysArgAlaArgArgLeuTyrValSerPheArgGluValGlyTrp                              202530                                                                        HisArgTrpValIleAlaProArgGlyPheLeuAlaAsnTyrCysGln                              354045                                                                        GlyGlnCysAlaLeuProValAlaLeuSerGlySerGlyGlyProPro                              505560                                                                        AlaLeuAsnHisAlaValLeuArgAlaLeuMetHisAlaAlaAlaPro                              65707580                                                                      GlyAlaAlaAspLeuProCysCysValProAlaArgLeuSerProIle                              859095                                                                        SerValLeuPhePheAspAsnSerAspAsnValValLeuArgGlnTyr                              100105110                                                                     GluAspMetValValAspGluCysGlyCysArg                                             115120                                                                        (2) INFORMATION FOR SEQ ID NO:19:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 118 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: BMP-2                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..118                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:                                      ArgGluLysArgGlnAlaLysHisLysGlnArgLysArgLeuLysSer                              151015                                                                        SerCysLysArgHisProLeuTyrValAspPheSerAspValGlyTrp                              202530                                                                        AsnAspTrpIleValAlaProProGlyTyrHisAlaPheTyrCysHis                              354045                                                                        GlyGluCysProPheProLeuAlaAspHisLeuAsnSerThrAsnHis                              505560                                                                        AlaIleValGlnThrLeuValAsnSerValAsnSerLysIleProLys                              65707580                                                                      AlaCysCysValProThrGluLeuSerAlaIleSerMetLeuTyrLeu                              859095                                                                        AspGluAsnGluLysValValLeuLysAsnTyrGlnAspMetValVal                              100105110                                                                     GluGlyCysGlyCysArg                                                            115                                                                           (2) INFORMATION FOR SEQ ID NO:20:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 118 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: BMP-4                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..118                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:20:                                      LysArgSerProLysHisHisSerGlnArgAlaArgLysLysAsnLys                              151015                                                                        AsnCysArgArgHisSerLeuTyrValAspPheSerAspValGlyTrp                              202530                                                                        AsnAspTrpIleValAlaProProGlyTyrGlnAlaPheTyrCysHis                              354045                                                                        GlyAspCysProPheProLeuAlaAspHisLeuAsnSerThrAsnHis                              505560                                                                        AlaIleValGlnThrLeuValAsnSerValAsnSerSerIleProLys                              65707580                                                                      AlaCysCysValProThrGluLeuSerAlaIleSerMetLeuTyrLeu                              859095                                                                        AspGluTyrAspLysValValLeuLysAsnTyrGlnGluMetValVal                              100105110                                                                     GluGlyCysGlyCysArg                                                            115                                                                           (2) INFORMATION FOR SEQ ID NO:21:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 119 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Vgr-1                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..119                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21:                                      SerArgGlySerGlySerSerAspTyrAsnGlySerGluLeuLysThr                              151015                                                                        AlaCysLysLysHisGluLeuTyrValSerPheGlnAspLeuGlyTrp                              202530                                                                        GlnAspTrpIleIleAlaProLysGlyTyrAlaAlaAsnTyrCysAsp                              354045                                                                        GlyGluCysSerPheProLeuAsnAlaHisMetAsnAlaThrAsnHis                              505560                                                                        AlaIleValGlnThrLeuValHisLeuMetAsnProGluTyrValPro                              65707580                                                                      LysProCysCysAlaProThrLysLeuAsnAlaIleSerValLeuTyr                              859095                                                                        PheAspAspAsnSerAsnValIleLeuLysLysTyrArgAsnMetVal                              100105110                                                                     ValArgAlaCysGlyCysHis                                                         115                                                                           (2) INFORMATION FOR SEQ ID NO:22:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 119 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: OP-1                                                               (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..119                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22:                                      LeuArgMetAlaAsnValAlaGluAsnSerSerSerAspGlnArgGln                              151015                                                                        AlaCysLysLysHisGluLeuTyrValSerPheArgAspLeuGlyTrp                              202530                                                                        GlnAspTrpIleIleAlaProGluGlyTyrAlaAlaTyrTyrCysGlu                              354045                                                                        GlyGluCysAlaPheProLeuAsnSerTyrMetAsnAlaThrAsnHis                              505560                                                                        AlaIleValGlnThrLeuValHisPheIleAsnProGluThrValPro                              65707580                                                                      LysProCysCysAlaProThrGlnLeuAsnAlaIleSerValLeuTyr                              859095                                                                        PheAspAspSerSerAsnValIleLeuLysLysTyrArgAsnMetVal                              100105110                                                                     ValArgAlaCysGlyCysHis                                                         115                                                                           (2) INFORMATION FOR SEQ ID NO:23:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 119 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: BMP-5                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..119                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23:                                      SerArgMetSerSerValGlyAspTyrAsnThrSerGluGlnLysGln                              151015                                                                        AlaCysLysLysHisGluLeuTyrValSerPheArgAspLeuGlyTrp                              202530                                                                        GlnAspTrpIleIleAlaProGluGlyTyrAlaAlaPheTyrCysAsp                              354045                                                                        GlyGluCysSerPheProLeuAsnAlaHisMetAsnAlaThrAsnHis                              505560                                                                        AlaIleValGlnThrLeuValHisLeuMetPheProAspHisValPro                              65707580                                                                      LysProCysCysAlaProThrLysLeuAsnAlaIleSerValLeuTyr                              859095                                                                        PheAspAspSerSerAsnValIleLeuLysLysTyrArgAsnMetVal                              100105110                                                                     ValArgSerCysGlyCysHis                                                         115                                                                           (2) INFORMATION FOR SEQ ID NO:24:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 120 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: BMP-3                                                              (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..120                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24:                                      GluGlnThrLeuLysLysAlaArgArgLysGlnTrpIleGluProArg                              151015                                                                        AsnCysAlaArgArgTyrLeuLysValAspPheAlaAspIleGlyTrp                              202530                                                                        SerGluTrpIleIleSerProLysSerPheAspAlaTyrTyrCysSer                              354045                                                                        GlyAlaCysGlnPheProMetProLysSerLeuLysProSerAsnHis                              505560                                                                        AlaThrIleGlnSerIleValArgAlaValGlyValValProGlyIle                              65707580                                                                      ProGluProCysCysValProGluLysMetSerSerLeuSerIleLeu                              859095                                                                        PhePheAspGluAsnLysAsnValValLeuLysValTyrProAsnMet                              100105110                                                                     ThrValGluSerCysAlaCysArg                                                      115120                                                                        (2) INFORMATION FOR SEQ ID NO:25:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 116 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: MIS                                                                (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..116                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:25:                                      GlyProGlyArgAlaGlnArgSerAlaGlyAlaThrAlaAlaAspGly                              151015                                                                        ProCysAlaLeuArgGluLeuSerValAspLeuArgAlaGluArgSer                              202530                                                                        ValLeuIleProGluThrTyrGlnAlaAsnAsnCysGlnGlyValCys                              354045                                                                        GlyTrpProGlnSerAspArgAsnProArgTyrGlyAsnHisValVal                              505560                                                                        LeuLeuLeuLysMetGlnAlaArgGlyAlaAlaLeuAlaArgProPro                              65707580                                                                      CysCysValProThrAlaTyrAlaGlyLysLeuLeuIleSerLeuSer                              859095                                                                        GluGluArgIleSerAlaHisHisValProAsnMetValAlaThrGlu                              100105110                                                                     CysGlyCysArg                                                                  115                                                                           (2) INFORMATION FOR SEQ ID NO:26:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 122 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Inhibin-alpha                                                      (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..122                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26:                                      AlaLeuArgLeuLeuGlnArgProProGluGluProAlaAlaHisAla                              151015                                                                        AsnCysHisArgValAlaLeuAsnIleSerPheGlnGluLeuGlyTrp                              202530                                                                        GluArgTrpIleValTyrProProSerPheIlePheHisTyrCysHis                              354045                                                                        GlyGlyCysGlyLeuHisIleProProAsnLeuSerLeuProValPro                              505560                                                                        GlyAlaProProThrProAlaGlnProTyrSerLeuLeuProGlyAla                              65707580                                                                      GlnProCysCysAlaAlaLeuProGlyThrMetArgProLeuHisVal                              859095                                                                        ArgThrThrSerAspGlyGlyTyrSerPheLysTyrGluThrValPro                              100105110                                                                     AsnLeuLeuThrGlnHisCysAlaCysIle                                                115120                                                                        (2) INFORMATION FOR SEQ ID NO:27:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 122 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Inhibin-beta-alpha                                                 (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..122                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:                                      HisArgArgArgArgArgGlyLeuGluCysAspGlyLysValAsnIle                              151015                                                                        CysCysLysLysGlnPhePheValSerPheLysAspIleGlyTrpAsn                              202530                                                                        AspTrpIleIleAlaProSerGlyTyrHisAlaAsnTyrCysGluGly                              354045                                                                        GluCysProSerHisIleAlaGlyThrSerGlySerSerLeuSerPhe                              505560                                                                        HisSerThrValIleAsnHisTyrArgMetArgGlyHisSerProPhe                              65707580                                                                      AlaAsnLeuLysSerCysCysValProThrLysLeuArgProMetSer                              859095                                                                        MetLeuTyrTyrAspAspGlyGlnAsnIleIleLysLysAspIleGln                              100105110                                                                     AsnMetIleValGluGluCysGlyCysSer                                                115120                                                                        (2) INFORMATION FOR SEQ ID NO:28:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 121 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: Inhibin-beta-beta                                                  (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..121                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:                                      HisArgIleArgLysArgGlyLeuGluCysAspGlyArgThrAsnLeu                              151015                                                                        CysCysArgGlnGlnPhePheIleAspPheArgLeuIleGlyTrpAsn                              202530                                                                        AspTrpIleIleAlaProThrGlyTyrTyrGlyAsnTyrCysGluGly                              354045                                                                        SerCysProAlaTyrLeuAlaGlyValProGlySerAlaSerSerPhe                              505560                                                                        HisThrAlaValValAsnGlnTyrArgMetArgGlyLeuAsnProGly                              65707580                                                                      ThrValAsnSerCysCysIleProThrLysLeuSerThrMetSerMet                              859095                                                                        LeuTyrPheAspAspGluTyrAsnIleValLysArgAspValProAsn                              100105110                                                                     MetIleValGluGluCysGlyCysAla                                                   115120                                                                        (2) INFORMATION FOR SEQ ID NO:29:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 115 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: TGF-beta-1                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..115                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:                                      HisArgArgAlaLeuAspThrAsnTyrCysPheSerSerThrGluLys                              151015                                                                        AsnCysCysValArgGlnLeuTyrIleAspPheArgLysAspLeuGly                              202530                                                                        TrpLysTrpIleHisGluProLysGlyTyrHisAlaAsnPheCysLeu                              354045                                                                        GlyProCysProTyrIleTrpSerLeuAspThrGlnTyrSerLysVal                              505560                                                                        LeuAlaLeuTyrAsnGlnHisAsnProGlyAlaSerAlaAlaProCys                              65707580                                                                      CysValProGlnAlaLeuGluProLeuProIleValTyrTyrValGly                              859095                                                                        ArgLysProLysValGluGlnLeuSerAsnMetIleValArgSerCys                              100105110                                                                     LysCysSer                                                                     115                                                                           (2) INFORMATION FOR SEQ ID NO:30:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 115 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: TGF-beta-2                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..115                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:30:                                      LysLysArgAlaLeuAspAlaAlaTyrCysPheArgAsnValGlnAsp                              151015                                                                        AsnCysCysLeuArgProLeuTyrIleAspPheLysArgAspLeuGly                              202530                                                                        TrpLysTrpIleHisGluProLysGlyTyrAsnAlaAsnPheCysAla                              354045                                                                        GlyAlaCysProTyrLeuTrpSerSerAspThrGlnHisSerArgVal                              505560                                                                        LeuSerLeuTyrAsnThrIleAsnProGluAlaSerAlaSerProCys                              65707580                                                                      CysValSerGlnAspLeuGluProLeuThrIleLeuTyrTyrIleGly                              859095                                                                        LysThrProLysIleGluGlnLeuSerAsnMetIleValLysSerCys                              100105110                                                                     LysCysSer                                                                     115                                                                           (2) INFORMATION FOR SEQ ID NO:31:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 115 amino acids                                                   (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: protein                                                   (vii) IMMEDIATE SOURCE:                                                       (B) CLONE: TGF-beta-3                                                         (ix) FEATURE:                                                                 (A) NAME/KEY: Protein                                                         (B) LOCATION: 1..115                                                          (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31:                                      LysLysArgAlaLeuAspThrAsnTyrCysPheArgAsnLeuGluGlu                              151015                                                                        AsnCysCysValArgProLeuTyrIleAspPheArgGlnAspLeuGly                              202530                                                                        TrpLysTrpValHisGluProLysGlyTyrTyrAlaAsnPheCysSer                              354045                                                                        GlyProCysProTyrLeuArgSerAlaAspThrThrHisSerThrVal                              505560                                                                        LeuGlyLeuTyrAsnThrLeuAsnProGluAlaSerAlaSerProCys                              65707580                                                                      CysValProGlnAspLeuGluProLeuThrIleLeuTyrTyrValGly                              859095                                                                        ArgThrProLysValGluGlnLeuSerAsnMetValValLysSerCys                              100105110                                                                     LeuCysSer                                                                     115                                                                           (2) INFORMATION FOR SEQ ID NO:32:                                             (i) SEQUENCE CHARACTERISTICS:                                                 (A) LENGTH: 4 amino acids                                                     (B) TYPE: amino acid                                                          (D) TOPOLOGY: linear                                                          (ii) MOLECULE TYPE: peptide                                                   (ix) FEATURE:                                                                 (A) NAME/KEY: Peptide                                                         (B) LOCATION: 1..118                                                          (D) OTHER INFORMATION: /note= "Xaa at positions 2 and 3 is                    any amino acid"                                                               (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32:                                      ArgXaaXaaArg                                                                  1                                                                             __________________________________________________________________________

We claim:
 1. Substantially pure growth differentiation factor-8 (GDF-8)having the amino acid sequence as set forth in SEQ ID NO:12 or SEQ IDNO:14.
 2. An isolated polynucleotide encoding the GDF-8 polypeptidehaving the amino acid sequence as set forth in SEQ ID NO:12 or SEQ IDNO:14.
 3. An isolated polynucleotide selected from the group consistingof:a) SEQ ID NO:11; b) SEQ ID NO:13; c) SEQ ID NO:11, wherein T can alsobe U; d) SEQ ID NO:13, wherein T can also be U; e) nucleic sequencescomplementary to SEQ ID NO:11; f) nucleic sequences complementary to SEQID NO:13; g) fragments of a) c), or e) that are at least 15 bases inlength and that will selectively hybridize to genomic DNA which encodesthe GDF-8 protein of SEQ ID NO:12; and h) fragments of b), d), or f)That are at least 15 bases in length and that will selectively hybridizeto genomic DNA which encodes the GDF-8 protein of SEQ ID NO:14.
 4. Thepolynucleotide of claim 2, wherein the polynucleotide is isolated from amammalian cell.
 5. The polynucleotide of claim 4, wherein the mammaliancell is selected from the group consisting of mouse, rat, and humancell.
 6. An expression vector including the polynucleotide of claim 2.7. The vector of claim 6, wherein the vector is a plasmid.
 8. The vectorof claim 6, wherein the vector is a viral vector.
 9. A host cellcontaining the vector of claim
 6. 10. The host cell of claim 9, whereinthe cell is prokaryotic.
 11. The host cell of claim 9, wherein the cellis eukaryotic.